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In this Issue 

The voice on the telephone one day last year was HP statistician Tim 
Read's. Pointing out that not all technical contributions are made by en- 
gineers, he made us an offer we couldn't refuse — to invite a group of HP 
statisticians to submit papers on applications of statistics. The results are 
five excellent papers that cover an impressive range of topics: designing 
electronic hardware, calibrating instruments, setting specifications, estimat- 
ing reliability, and managing software projects. In "Statistical Issues in Setting 
Product Specifications," page 6, we learn how the science of statistics helps 
ensure that specifications are useful for describing the product's perfor- 
mance, relevant for competitive comparisons, measurable and verifiable, and valid over the 
required environmental ranges and time intervals. This paper takes a tutorial approach to its 
subject. "Circuit Design Using Statistical Data Analysis," page 12, tells how measurements on 
prototype amplifiers have been analyzed statistically to guide the design of adjustment-free 
amplifier boards. "Statistical Calibration of a Vector Demodulator," page 18. describes the design 
of the statistical demodulator calibration algorithm that is programmed into the HP 8981 A Vector 
Modulation Analyzer. The algorithm can be used to calibrate not only the built-in demodulator but 
also external demodulators. "An Availability and Reliability Tool for Computer Systems," page 
26, describes a software program that computes system availability based on reliability data for 
the system components. The program helps R&D engineers make trade-offs in designing for 
reliability. "Project Management Using Software Reliability Growth Models," page 30, relates one 
HP Division's experience with a model-based method for predicting the duration of software 
system testing. The model also helps management decide when to release a product. 

Half-inch reel-to-reel tape recorders are widely used for archival storage of computer data, 
backing up data stored on disc memory, and interchanging data between systems. The HP 7980A 
Tape Drive, besides being faster and more reliable than its predecessors, is interesting because 
it's an autoloading drive. Designed to mount in a rack with the reels and tape path in a horizontal 
plane rather than the more conventional vertical plane, it takes up much less rack space, but its 
reels and tape path aren't readily accessible for the user to mount reels and thread tape. However, 
no problem. All the user need do is open the door and slip in a reel of tape. The HP 7980A 
automatically threads the tape onto the takeup reel. How does it do that? Its designers tell how 
in the article on page 36. The drive's state-of-the-art control electronics (page 43) provide features 
such as variable-velocity rewind, which chops 25% off the 120 seconds it used to take to rewind 
a 2400-foot reel of tape. 
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In the February 1988 issue, we presented the design story of a family of digitizing oscilloscopes 
based on waveform recorder technology. The primary objective in these instruments design was 
maximum fidelity within a given bandwidth. Featured this month is a digitizing oscilloscope designed 
for maximum speed, which means maximum bandwidth for capturing single-shot phenomena. 
The key element in the HP 54111D Oscilloscope is the advanced one-gigasample-per-second 
analog-to-digital converter (ADC) shown on the cover. Thanks to this ADC's speed, the HP 541 1 1 D 
can capture and display signals containing frequencies up to 250 megahertz with a single look. 
This wide single-shot bandwidth is needed for digital system diagnostics, high-energy physics, 
transient analysis, and other applications. After a brief introduction (page 58) by Joe Millard, who 
was project manager for the HP 54111D's development, three authors from HP Laboratories 
describe the fast-ADC technology (page 59). Four digitizers are interleaved in this design, which 
uses gallium arsenide, silicon bipolar, and silicon NMOS integrated circuits and thick-film hybrid 
circuit technology. The wideband hybrid preamplifier and attenuator circuits at the instrument's 
input are described in the article on page 67. Filtering, bandwidth, noise, and waveform reconstruc- 
tion issues are discussed in the paper on page 70, which shows how the six-bit ADC can provide 
seven or eight bits of information for sufficiently low-frequency input signals. 

-R.P. Dolan 



What's Ahead 

In the August issue, eight articles will discuss the design of the HP PaintJet Color Graphics 
Printer, with particular emphasis on the design and manufacturing of the print cartridge. Aluminum 
gallium arsenide (AIGaAs) red light-emitting diode lamps are the subject of another article. HP-RL, 
the Hewlett-Packard Representation Language, was used for expert systems research in HP 
Laboratories until recently. We'll have a paper summarizing its contributions and another paper 
on MicroScope, an expert system for program analysis, which was originally written using HP-RL. 
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Statistical Issues in Setting Product 
Specifications 

A primer on the use of statistics in specification setting. 
by Sherry L. Read and Timothy R.C. Read 



PRODUCT SPECIFICATIONS are frequently the first 
chance that a potential customer has to evaluate a 
new product. The purpose of such specifications is 
to characterize the product's performance and to warrant 
a minimum performance level to the customer. Con- 
sequently, it is extremely important that the specifications 
accurately reflect the quality and performance of the prod- 
uct. Conservative specifications that downplay the true per- 
formance are likely to benefit competitors selling similar 
products who provide a more realistic assessment of the 
specifications. On the other hand, specifications that are 
too ambitious for the product are likely to cause high man- 
ufacturing cost and possible customer dissatisfaction. 

The choice of product specifications is driven by cus- 
tomer needs, product design, competition, and manufactur- 
ing capabilities. Final setting of specifications should ac- 
count for all these aspects, as well as for statistical evidence 
of product performance from a production pilot run. In 
particular, the process of setting specifications needs to 
reflect the accuracy and repeatability of a single production 
unit and the variation between different units. 
Examples of typical product performance questions are: 
With what level of certainty will the harmonic distortion 
of this signal generator lie within a certain limit? 
With what confidence will the noise power of a single 
100-MHz crystal oscillator vary by less than 10 percent 
over a two-week period? 
o What is the probability that a product meeting test 
criteria at room temperature will meet or exceed specifi- 
cations over all warranted environmental conditions? 
In this paper we illustrate how statistical methods of 
data analysis can help product designers, manufacturing 
and marketing engineers provide efficient and accurate an- 
swers to such questions. 

Objectives 

What are the objectives for the process of setting specifi- 
cations? 

Specifications should be: 

Useful in describing product performance to customers 

and field offices 
a Relevant for competitive comparisons 
n Measurable and easily verifiable in production, at service 

centers, and by customers 
« Valid over the appropriate environmental conditions 

and time intervals. 

To meet these objectives and provide maximum informa- 
tion to the customer, product specifications are frequently 
presented in a two-tier structure.* The top tier describes 



warranted specifications and the second tier describes sup- 
plemental characteristics. The supplemental characteris- 
tics provide information useful in product applications by 
giving typical, but nonwarranted, performance parameters 
for the laboratory environment. In this paper, we shall con- 
centrate on the setting of warranted specifications. How- 
ever, the same general methods can be applied to supple- 
mental characteristics. 

Statistics, the science of drawing conclusions and mak- 
ing decisions in the face of uncertainty, aids in achieving 
many of the objectives for specifications. Based on produc- 
tion pilot units, statistical methods are used to estimate 
the expected performance of future production units. En- 
vironmental effects can be modeled so that not all units 
have to be tested overall environments. Specific graphical 
tools provide a clear picture of performance to facilitate 
the decision process by marketingand manufacturing when 
setting the final specifications. Statistics is the key to under- 
standing and efficiently quantifying all sources of variation. 

Statistical Model 

The underlying model for the specifications of many 
Hewlett-Packard products is illustrated in Fig. 1. In this 
diagram, production margin is the difference between the 
average product performance and the test line limit (TLL), 
which is the pass/fail limit used by the production line at 
final test under standard environmental conditions. Delta 
environmental (DE) represents the possible change in per- 
formance of the product over the environmental extremes 
specified on the data sheet (e.g., temperature and humid- 
ity), and drift represents the change in performance over 
the calibration period or other specified time interval. Mea- 
surement uncertainty (MU) accounts for possible measure- 
ment errors in the equipment used to characterize the prod- 
uct. The customer guardband represents any additional 

"This structure was originally proposed lo the National Conference ot Standards 
Laboratories (U S A ) by Jonn Minck m 1977 
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Fig. 1. Representation of the components Wat make up a 
warranted product specification, TLL is the test line limit used 
by the production line at final test 
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Robust Estimators 



The sample mean and sample standard deviation are the best 
estimators of the location and scale ot a Gaussian or normal 
distribution However, a lew large errors may occur in the data, 
resulting in a distribution with heavier tails than the Gaussian in 
this case, estimators that piace less emphasis on extreme values 
may provide more appropriate estimates for the mean and stan- 
dard deviation 

One such set of estimators consists of the trimmed mean and 
the Winsonzed standard deviation To calculate these estimates, 
trim off g observations from either end of the set of n ordered 
observations x, < x 2 < . < x„. where g is the closest integer 
to (0 1)n Then the 10% trimmed mean is given by: 

x, = (Xg,, + x g .* + ... + x n ^ g ) / (n-2g), 

and the Wmsorized standard deviation is calculated as 

s, = {[(g + 1 Hx,,*, - x,) z + (Xg.,2 - x,) ? + ... 

+ (x n _ g _,-x,) 2 + (g + 1)(x n . a - x,) 2 ]/(n-2g-1)}" J . 

The Winsonzed standard deviation formula treats the large (or 
small) observations that were trimmed as if they were equal to 
the largest ( or smallest) observations in the trimmed sample. 

When the trimmed mean and Winsonzed standard deviation 
are used, the effective sample size is the size of the trimmed 
sample Hence, when finding the K factors for use in x, + Ks,. 
enter the tables with n = n - 2g. 

Other robust estimators, such as the biweight, are described 
in References 1 , 2. and 3 listed on page 1 1 



guardband considered necessary to ensure that in a worst- 
case scenario, any customer's unit will perform better than 
the published specification (this additional guardband is 
sometimes set to zero). The right end of Fig. 1 (SPEC) rep- 
resents the warranted customer specification as described 
by the product data sheet. 

When finalizing product specifications, it is necessary 
to look at Fig. 1 from two directions: from the SPEC inward 
and from the observed pilot data outward. The former di- 
rection involves setting internal criteria (the TLLs) to 
guarantee that the product performance meets specifica- 
tions over all warranted conditions. The latter direction 
involves setting specifications that will allow a high yield 
in manufacturing and simultaneously minimize the chance 
that good units will be rejected and maximize our confi- 
dence that the units passed will perform as specified. We 
shall focus on the statistical aspects of moving from the 
production pilot units outwards to the preliminary specifi- 
cations. However, it is important to realize that the final 
specifications are set by iterating between both approaches 
to ensure that the final product has a high yield and satisfies 
the market needs. 

Using our experience of specific product introductions, 
we will describe the relevant statistical methods needed 
to complete the estimation of each of the quantities in Fig. 
1 based on production pilot data. The same approach is 
also used to adjust specifications on more mature products. 

Estimating the Test Line Limit 

Based on the assumption that the production pilot units 
are representative of the product characteristics that can 
be expected in production, the goal is to use these pilot 
units to calculate the TLL so that the vast majority of pro- 
duction units will be better than this limit. In particular, 
we need to ensure that the products will be readily man- 
ufacturable over the long term at a low cost. 

If we use the convenient assumption that tin? production 
pilots come from a Caussian (normal) distribution with a 
population mean of /x and standard deviation of tr, then the 
interval (/i - 2a, ft + 2a) contains 95% of the distribution 
and [fi - 3a. n + 3rj) contains more than 99% of the distri- 
bution. This normality assumption allows us to model the 
expected production pilot distribution and the correspond- 
ing percentiles using only the two parameters n and a. 



- 




8> o.o 1 1 1 1 1 1 1 1 

-4-3-2-10 1 2 3 4 

Expected Value from Gaussian Distribution 

Fig. 2. Ouanlile-quanlile plot ol spurious distortion data on 
a voltage scale. 



While there are good reasons to assume normality, it is 
important to check the validity of this assumption before 
proceeding to estimate the percentiles of the production 
pilot distribution. Histograms are frequently used to verify 
the symmetry of a distribution. However, the investigation 
of normality is much clearer when the data is plotted on 
a Caussian quantile-quantile pint as illustrated in Figs. 2 
and 3. 

Here the data values are plotted against the values that 
would be expected from a Caussian distribution. If the data 




« -75 1 1 . 1 1 1 1 1 

-4-3-2-1 0 1 2 3 4 

Expected Value trom Gaussian Distribution 

Fig. 3. Ouantile-quantiie plot ol spurious distortion data on 
a dBc scale. 
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is Gaussian, the values should lie approximately on a 
straight line. Such plots as these help to check assumptions, 
to determine the appropriate scale in which to set specifi- 
cations, and to detect measurements or units that appear 
not to conform to the bulk of the data. To illustrate the 
appropriate choice of scale, Figs. 2 and 3 represent data 
on spurious distortion for an HP Stanford Park Division 
product on both the linear (voltage) and the log (dBc) scale, 
respectively. Fig. 3 shows that the measurements on the 
log scale are more nearly Gaussian than those on the linear 
scale in Fig. 2. Consequently, the calculations for determin- 
ing the TLL of this product should be done in dBc, 

A different situation occurs in Fig. 4, which represents 
a quantile-quantile plot of intermodulation distortion from 
some pilot-run models of the same product. While the bulk 
of the data appears to be nearly Gaussian, one instrument 
(circled) appears to be different, exhibiting extremely good 
performance. This outlying instrument should be ac- 
counted for in the estimation of the mean and standard 
deviation used to calculate the TLL. The following section 
outlines the methodology for dealing with such points. 

Estimating /< and a 

Consider now replacing the mean ii and the standard 
deviation a with their standard estimates 



x = x. 



x n )/n 



and 



s = {[(x,-*) 2 + (x 2 -x) 2 + ... + [x n -xf]/{n-D} V2 . 

respectively, where x, represents the measurement value 

from pilot unit i (i = 1, 2 n) and n is the total sample 

size. We are faced with two potential problems. First, con- 
sider the situation where one or two extreme points are 
found in the sample, such as in Fig. 4 for the intermodula- 
tion data. If these points occur only at the opposite extreme 
from where the TLL is to be set, then under certain cir- 
cumstances we may wish to use estimates of it and o- that 
are less sensitive to (i.e., more robust against) individual 
extreme departures from a Gaussian distribution. The stan- 
dard estimates x and s given above are notoriously non- 
robust. An example of how one extreme point can throw 
off x and s is illustrated in Fig. 5 below. Here both sets of 

-58 -r 
CD -60 " 

- • • 
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SO -66- 
£ ? 

°S -68- 
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%% -70- 
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I -72- 
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-1.5 -1.0 -0.5 -0.0 0.5 1.0 
Expected Value from Gaussian Distribution 



1.5 



Fig. 4. Quantile-quantite plot of two-tone intermodulation dis- 
tortion data, showing an extreme point. 



Observation Set 1 
x = 5.0 s = 3.3 n = 11 



Observation Set 2 
x = 5.0 s = 3.3 n = 1 1 



0 

(a) 



10 



15 



0 

(b) 



10 



15 



Fig. 5. Both sets of observations have the same values lor 
x and s, but the values for (b) are inflated because of the 
extreme point at 15 

observations have the same values for x and s. However, 
clearly, x and s are inflated for the set of observations in 
Fig. 5(b) because of the one extreme point at 15. 

There are a number of procedures for robust estima- 
tion. 12,3 For example, the standard estimates x and s for 
the intermodulation data in Fig. 4 are - 63.53 dBc and 4.59 
dBc, respectively, whereas the more robust 10% trimmed 
mean and Winsorized standard deviation (see "Robust Es- 
timators." page 7) for this data are -62.36 dBc and 1.64 
dBc. respectively. When such methods are used, the en- 
gineer must be certain that the extreme points are well 
understood, and could not equally well have occurred on 
the other extreme of the distribution. Unless the mecha- 
nisms causing such extreme observations are understood, 
robust estimators should be used with extreme caution in 
this context. 

The second potential problem in replacing fi and a by 
their estimates is that probability statements such as "the 
interval (ii - 2a. fi + 2a) contains 95% of the distribution" 
are no longer true in general. The amount of the underlying 
distribution covered by the interval (x - 2s, x + 2s) or 
(x - 3s, x + 3s) depends on how accurately x and s 
estimate fi and <r, which in turn depends on the sample 
size n. For large samples, the statement "the interval (x - 2s. 
x + 2s) contains 95% of the distribution" will be approx- 
imately true (as a result of the Central Limit Theorem). But 
how large a sample size is needed to achieve reasonable 
accuracy? 

This question is answered by reference to suitable toler- 
ance limit tables. 4,5 Given the sample size, these tables 
provide a level of confidence that the interval does indeed 
cover the requested percentage of the distribution. For 
example, based on a sample of 10 units, if we want an 
interval that covers 95% of the distribution with 95% con- 
fidence, we would have to use (x — 3.4s. x + 3.4s) rather 
than (x — 2s, x + 2s). This tolerance interval procedure 
accounts for the random variations in the estimates of both 
^ and it, and provides an interval that will include at least 
a specified percentage of the population with a high degree 
of confidence. 

In conclusion, our procedure for setting the test line 
limits involves three steps. First, verify the distributional 
assumption of a sample from a Gaussian distribution. Sec- 
ond, estimate the mean and standard deviation of the per- 
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Propagation of Error with 
Multiple Sources of Variability 

For a single source of variability the standard deviation a can 
be estimated by 

s = |[(x,-x? + + ♦ (Xn-xfl/tn-l)}'*. 

where thex,(i = 1.2, n) represent measurements on n units 

However, it frequently happens that the test characteristic is 
measured as a known function of other independent measured 
variables For example, suppose that y, . y 2 , and y 3 are the heights 
of ihree types of discs If one stacks the discs on top of each 
other, then the height of the stack is h = y, + y 2 + y 3 . Given 
tolerance limits on the individual discs, what are the tolerance 
limits on the overall stack height' Define s,, S;.. and S 3 to be the 
standard deviations of y.. y 2 . and y 3 . based on n,, n 2 . and n 3 
data points, respectively Then the law of propagation of errors 
states that the standard deviation of h is given by 

s h = Vs? + s| + s|. 

Note that this formula requires only that the measured variables 
y,. y 2 . and y a are independent of one another Further distribu- 
tional assumptions are not necessary. 

Tolerance intervals can now be constructed for h according 
lo the methods discussed in this paper, using an effective sample 
size ol n = n, + n 2 + n 3 - (3 - 1 ) 



formance using a standard or robust procedure. Finally, 
using the K factors for tolerance intervals (based on the 
production pilot run sample size), set TLLs at x + Ks or 
x - Ks as appropriate. These limits will provide the first 
step in setting customer specifications and provide knowl- 
edge, with high confidence, of what can be expected from 
future manufacturing, assuming a stable manufacturing 

process. 

Estimating Delta Environmental 

To guarantee that the product will meet or exceed its 
warranted performance specifications over all environmen- 
tal conditions, it is necessary to estimate the expected 
change in product performance from the standard test con- 
ditions to the worst-case environment. In most cases, this 
can be estimated as the average measurement in the worst- 
case environment minus the average measurement in the 
standard (or test) environment, that is. 

DE = x onv - x sl( j. 

Some products will experience not only a mean shift 
over an environmental range, but also an increase in the 
variability of their performance. This increase in variability 
can be accounted for by the following equation: 

DE = x miv - x, ld + K(s„ nv - s., d ). 

Pig. 6 illustrates both situations by depicting the distribu- 
tion of a population of units in the standard operating 



environment and in the worst-case environment. If s nnv = 
s nd , as in Fig. 6a. then DE will be equal to x env - x„ d . If 
s env ' s greater than s 5lll (Fig. 6b). then DE will be equal to 
the difference in some percentile. By accounting for DE. 
customers can be assured that the units tested to the TLL 
in the standard environment will meet or exceed specifica- 
tions over the warranted conditions. 

Estimating Measurement Uncertainty 

Measurement uncertainty accounts for possible changes 
in measurements caused by the inherent variation of the 
test or measurement equipment. The most common method 
of estimating this variation is to combine the specifications 
provided on the test equipment data sheets for each piece 
of equipment integrated into the test system. For example, 
if there are four pieces of equipment making up the mea- 
surement system, with individual specifications ±r,, ±r 2 . 
— r 3 . ±r 4 . then typically we can estimate the respective 
standard deviations s,. s 2 , s 3 . and s 4 as one half or one 
third of these values (if they are given as 95% or 99% limits, 
respectively). Using the root-sum-square (RSS) principle 
(see "Propagation of Error with Multiple Sources of Vari- 
ability," this page), the measurement uncertainty for the 
system is calculated as 

MU = V (s, 2 + sf + s, 2 + s 4 2 ). 

Setting Preliminary Specifications 

To account for all the sources of variation, the prelimi- 
nary specifications relative to manufacturing capabilities 
would be defined as 



SPEC = x + V (Ks) 2 + (3MU) 2 + DE. 

or 

SPEC = x - V (Ks) 2 + (3MU) 2 + DE. 




Fig. 6. Estimation ot delta environmental la) when the mean 
is a tunction of the environment but the standard deviation is 
constant, and (b) when both the mean and the standard de- 
viation are tunctions ot the environment 
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Fig. 7. Boxplols of common- 
mode rejection data for different 
instruments and channel numbers 
Serial Number.Channei over alt environmental conditions 



226.1 227.1 230.1 247.1 248.1 249.1 250.1 254.1 255.1 256.1 

226.2 227.2 230.2 247.2 248.2 249.2 250.2 254.2 255.2 256.2 



as appropriate. Again we use the RSS principle to combine 
the unit variation (s 2 ) and the measurement uncertainty 
(MU 2 ). Delta environmental is treated as a bias. This is the 
preliminary SPEC based on evidence of manufacturing ca- 
pabilities, but these manufacturing capabilities do not cap- 
ture all the considerations in choosing the final specifica- 
tions. Much of the final decision must be based on cus- 
tomer, competition, and business considerations. 

Graphical Tools 

To facilitate discussion and understanding of the pre- 
liminary specifications, graphical displays of the pilot data 
with TLLs and preliminary specifications are useful. In 
particular, boxplots (see "Boxplots," page 11) are impor- 
tant tools for displaying the magnitude and dispersion of a 
number of sets of data. For the HP 5183 A Waveform Record- 
er, the dispersion of measurements for each pilot run instru- 
ment over the various environmental conditions is shown 
in Fig. 7, and the aggregate pilot run data for each of the 
tested environments is shown in Fig. 8. Fig. 7 allowed the 
group responsible for setting specifications to identify the 
particular channels that performed exceptionally well (e.g., 
#250.1) or showed little variation over temperature and 
line voltage (e.g., #247.2). Fig. 8 indicates that there is little 
difference between environments. These graphical tools 
help identify cases that are different from the rest, a neces- 
sary step in improving the consistency of products and the 
manufacturing process. 

Boxplots can be used to evaluate specifications over a 
range of measurements. For example. Fig. 9 is a plot of the 
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Fig. 8. Boxplots of common-mode rejection data for different 
environments for all instruments. 



pilot run results of the same measurement on the various 
HP 5183A Waveform Recorder ranges. This kind of graphi- 
cal display helps determine the ranges where the specifica- 
tions should change. If there are any ranges that are particu- 
larly variable, they can be pinpointed for more intensive 
testing, providing a basis upon which to optimize the effi- 
ciency of the internal testing process. 

Once preliminary specifications are established, many 
of Ihe specifications will be as expected and will require 
little if any discussion, while other specifications may re- 
quire some specific action. A systematic approach to cal- 
culating the preliminary specifications is important. This 
includes validation of assumptions, a clear understanding 
of the sources of variation, and graphical displays of all 
the pertinent information. This approach will do much to 
ensure that the decisions about final specifications result 
in a valid representation of a product's performance upon 
which customers can depend. 
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Boxplots 



Boxplois are a graphical tool displaying the salient teaiures of 
the distribution of a set ot data- namely ihe location , the spread 
any asymmetry, ano outlying data points ' Because of their com- 
pactness, ooxplots are especially useful lor comparing several 
sets of data on one graph 

Fig 1 shows a histogram and a boxpiot of a single set of data 
in this boxpiot. the maximum value is not joined to the box by 
the "whisker (i e the line extending from the box), and is iden- 
tified separately as an unusual value II we define the difference 
between the 75th percentile and the 25th percentile as the inter- 
quartile range (IQR). then unusual values are those values that 
are more than 1 5 IQR away (rom the appropriate percentile. In 
Ihe case of an unusual value, the whisker will end at the largest 
(smallest) data value that is within 1 5 IQR ol the 75th (25th) 
percentile. 
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Circuit Design Using Statistical 
Data Analysis 

A methodology for setting limits of uncertainty on amplifier 
output power based on measured prototype data facilitates 
the design of amplifiers that minimize output power 
variations. The result is a printed circuit board requiring no 
adjustments. 

by Karen Kafadar and Lynn M. Plouse 



A COMMON SITUATION facing the circuit designer 
is the selection of design parameters for certain cir- 
cuits so that specified targets and variations in out- 
put power levels are met, This is the case for several circuit 
boards that are part of the HP 86792A Agile Upconverter, 
a fast-switching synthesizer that can upconvert complex 
modulation with up to 32-MHz bandwidth to anywhere in 
the 10-to-3uf)()-MHz range. These boards contain several 
circuits such as amplifiers, pads, frequency dividers, power 
splitters, switches, and filters. While some circuits, such 
as power splitters and dividers, involve few design choices, 
other circuits, such as amplifiers and pads, involve param- 
eters that can be chosen in a variety of ways to meet target 
output levels. The aim is to select values for those param- 
eters so that variation of the output power is minimized. 
If the overall variation in the output power is small enough, 
no adjustments are necessary, thereby reducing cost and 
labor requirements in production. 

Statistical methodology has been applied to this project 
in several ways. Measurements on the different types of 
circuits provide information on output levels and expected 
variation. Such data, collected in the prototype phase, is 
used to select design parameters for amplifiers so variation 
of the output power can be minimized. Variations caused 
by other sources, such as temperature and input power 
variations, are assessed. Finally, the results from the lab 
pilot phase are compared with those from the prototype 
phase for prediction of performance in production. 

This paper discusses the application of statistical 
methodology to seven printed circuit boards used in the 
prototype design phase. This project illustrates the power 
of statistics and the value of a statistical analysis of data 
collected early in the design stage. The success of Ibis 
approach is verified by the close prediction of lab pilot 
units from the prototype, and. more important, by the labor 
savings from designing a printed circuit board requiring 
no adjustments. 

Circuit Layout 

The new synthesizer upconverts a modulated input sig- 
nal. Several fast switches allow fast hopping of the carrier. 
Since each switch drives a mixer, its output level is critical. 
If the output power is too high, the mixer's spurious outputs 
(spurs) increase, degrading the overall spur performance 



of the instrument. On the other hand, if the switch output 
power is too low. the signal-to-noise ratio gets worse. Op- 
timally, the output levels of the switches should be within 
a 3-dB window, regardless of output frequency, tempera- 
ture, or components used to assemble the circuit. 

Often the target output levels on circuit boards of this 
type are achieved by adjustments for each pole of the 
switch. Such a procedure requires at least 16 adjustments 
for these particular boards, depending upon how they were 
implemented. At a minimum rate of $12 per adjustment, 
this is highly costly in production. A better way uses statis- 
tical analysis to determine the loss and variation of the 
circuits in the switch modules. In particular, some of these 
circuits (e.g.. the amplifiers that drive the circuits) can be 
designed to minimize output variation caused by variation 
of the incoming power. 

Sources of Variation 

Having defined the goals of the circuitry, data on various 
circuits must be collected and analyzed to help realize 
these goals. Depending upon availability of components 
and required test time, different circuits of each type are 
measured. This data provides estimates of output power 
levels and variations as a function of input power. Several 
sources of variation in these measurements can be identi- 
fied: 

n Variations in incoming power levels 

«• Errors in input power level transmitted by the circuit 

o Temperature biases on the output 

■ Circuit component variation 

■ Variation in the manufacturing of the circuit board ma- 
terial 

• Measurement error. 

In this study, the incoming power is actually the output 
power from an earlier portion of the instrument, which 
itself has a prespecified variation by design. The transmis- 
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Fig. 1. A typical power path on one board at a single fre- 
quency (470 MHz) 
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sion of the error in this incoming power depends upon the 
circuit (see next section). 

All circuits are required to perform over a temperature 
range to pass environmental testing: hence the offsets in 
power levels because of extreme heat |70°C( and cold (5°C) 
are estimated (see ' Temperature Testing." page 14). 

Having assembled all circuits on printed circuit boards, 
variations in lots of hoard material or components may 
affect output power: again, such variation is best estimated 
by taking measurements on different boards (see "Compo- 
nent Variation," page 15). 

Finally, this analysis involves two components of vari- 
ation that come from measurement error. The first arises 
directly from the test system and is assessed most typically 
from customer specifications. The second component in- 
volves an error imposed by the analysis, because input 
power levels are not identical for measurements of circuits 
taken on different occasions (different temperatures, com- 
ponents, etc.). To make fair comparisons of the output 
levels, most data had to be linearly interpolated to a com- 
mon input power scale. This common scale was an interval 
within the span of the data confined to the integers (e.g.. 

- 10 dBm, -9 dBm 10 dBm). The maximum error in 

linear interpolation is given by one eighth of the second 
derivative of the function and estimated by one fourth of 
the second divided difference. Only rarely did this maxi- 
mum error exceed 0.02 dB. Generally, this error was a 
negligible component of the overall uncertainty in the 
power level. However, in other less precise measuring situ- 
ations, this error might not be negligible. 

The variations from these different factors are estimated 
and combined into an overall standard deviation by a root 
sum of squares (RSS) method. This method is almost always 
justified by the fact that the variance (square of the standard 
deviation) of a sum of independent random quantities 
equals the sum of the variances, regardless of how the 
individual random quantities are distributed. In some cases 
where two factors were measured simultaneously (e.g.. 
temperature and boards), the interaction between them was 
negligible and was therefore ignored. 

For simplicity , and in accordance with recommendations 
by the Bureau International des Poids et Mesures.' all vari- 
ations are given in terms of plus or minus one standard 
deviation (SD or a), except where noted. When a Gaussian 
(Utopian) model for the data applies, 2.576 multiples of 
the population SD should include 99% of the population. 
When only ten units have been measured and the Gaussian 
model is assumed. 3.018 multiples ol the estimated SD 
includes 95% of the population with 90% confidence. 2 A 
rough translation of a customer specification of a test sys- 
tem into a la uncertainty is mostly guesswork; often one 
half to one third of the customer specification is treated as 
one standard deviation. Customer specification of the 
power meters in this study is 0.2 dB; hence a MV ~ 0.07 
dB. (Throughout this paper, it will refer to an estimate of 
the variation from a specific source, which will be iden- 
tified by the subscript on ir.] 

Transmission of Error 

In this section, we illustrate the statistical methodology 
by considering one path from one board corresponding to 



one frequency input and output: this path is shown in Fig. 
1 Power is transferred along this path through a series of 
circuits that includes a power splitter, an amplifier, a 
switch, and some pads to absorb excess power. The input 
puwer level from the preceding circuitry for this path is 
estimated at 3.91 dBm ±0.94 dB. The power splitter incurs 
a loss of 3.8 dBm ±0.16 dB. Hence, using RSS. the power 
preceding the amplifier is at most 0.11 dBm i0.95 dB. 
Increasing the loss of the amplifier input pad will lower 
the output power but not change the variation of the trans- 
mitted power. The goal is to design the amplifier so that 
the target output power after the switch is - 6,5 dBm. with 
a 99% uncertainty interval of ±1.5 dB, 

Knowledge of the transmission of error variation through 
a circuit provides the greatest leverage in reducing overall 
variation in the output power of the switch. Consider a 
typical response curve from a switch in Fig. 2. It is clear 
that the output variation transmitted around P„ is larger 
than that around P,. To a first approximation, this output 
variation depends upon the slope ol the curve at the input. 
Taylor's theorem provides the basis for this "law of prop- 
agation of error": 3 

output = Rp) - f(P„) + (P - P () )f(P (1 ). 

= constant + (P — constant) x constant. 

Since 

SD (constant + X) = SD(X) 

SD (constant x X) = constant x SD(X) 

we have that 

SD(output) • If'IPJlxSDIinput) (1) 




P o ±0 66 dB 

Power Into Switch (dBm) al 4 70 MHz 



P,=0.66 10 
dB 



Fig. 2. Typical switch response curve at 470 MHz. showing 
input power at P 0 ± 0 66 dfl to achieve target output ol -6.5 
±0.5 dBm 
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Thus, output variation is related to the input variation 
through the slope of the function relating the output lo the 
input near the point of input power. One may question the 
accuracy of this variation when we use only the leading 
term in the Taylor series expansion. Examples indicate that 
such approximations are in fact quite satisfactory.'' Further 
uses of this law of propagation of error will he demonstrated 
later in connection with combining several sources of vari- 
ation into a single uncertainty statement. 

To meet other specifications, the switch must operate 
where its response is linear. Using equation 1. and given 
the maximum allowable standard deviation of the output 
and the slope of the curve of the designed switch, one can 
calculate the maximum allowable variation in input power 
to the switch. To illustrate, the switch in Fig. 2 has a target 
output of -6.5 dBm and an allowable variation of ±1.5 
dB around this target. The allowable overall (RSS) standard 
deviation of the output power is therefore 0.6 dB, since 2.5 
multiples of this standard deviation equals the desired 
±1.5-dB window with 99% confidence. If the output power 
variation from components is 0.3 dB, then the allowable 
output variation from transmission of error by the switch 
is 0.5 dB, since 0.3 dB and 0.5 dB yield an RSS result of 
approximately 0.6 dB. The slope of the switch function at 
the input level (denoted P„ in Fig. 2) corresponding to an 
output of — 6.5 dBm is 0.76. Thus, the maximum allowable 
variation in input power to the switch is 0.5/0.76 = 0.66 
dB around a target input power of P„ = 4.69 dBm. 

Fig. 3 shows a family of amplifier curves designed with 
three values of collector current. Notice that the effect of 
the collector current is to vary the point at which the slope 
of the curve starts to decrease, that is, the point of amplifier 
compression. The price for this smaller slope (and hence 
smaller output variation) is less output power. Recall that 
sufficient power out of the amplifier is necessary to drive 
the switch and obtain the target — 6.5 dBm out of the switch. 
(If the amplifier output exceeds 4.69 dBm, the loss of the 
pad before the switch can be increased to absorb the excess 
power.) In addition, we seek those curves whose variation 
at the output of 4.69 dBm or above does not exceed 0.66 
dB. Fig. 3 shows that the slope, and hence the output vari- 
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atioil, is smaller at higher input power levels (amplifier 
compression), but recall that, on average, only 0.11 dBm 
of power is available at the input to the amplifier. 

With a maximum limit of 0,11 dBm of input power, we 
calculate the slopes of the curves corresponding to the 
three amplifiers in Fig. 3. These slopes are estimated by 
successive first differences and are shown in Fig. 4. Notice 
that the slopes do not drop below the maximum allowable 
slope of 0.66 dB until the input power reaches at least 
-0.76 dBm, 1.7 dBm, or 3.7 dBm, respectively. Only the 
15-mA curve satisfies both the incoming power constraint 
and the maximum slope constraint. 

The allowable interval ( - 0.76 to 0.11 dBm) is indicated 
in Fig. 3. Since the slope is smallest at 0.11 dBm. one might 
reduce the output power of 7.31 dBm after the amplifier 
with a 2.62-dB pad. For other engineering reasons, a 1-dB 
pad is placed before the amplifier. This increases the slope 
only slightly to 0.70. The output of the amplifier at P hl = 
- 0.89 dBm is 6.62 dBm, so a 2-dB loss on the pad preceding 
the switch ensures the target -6.5 dBm at the switch out- 
put. 

This same analysis is applied to all circuits in the pro- 
totype stage of the design. An additional consideration is 
the effect of temperature on the circuits. This effect is dis- 
cussed in the next section. 

Temperature Testing 

Circuits must perform under a variety of environmental 
conditions. One such set of conditions is extreme temper- 
atures. Measuring at least two circuits of each type at 5°C 
(cold), 25°C (room), and 70°C (hot) allows an estimate of 
the effect of temperature on the circuits. This effect on 
some circuits is minimal; forexample, forthe power splitter 
in Fig. 1, the change in power is only ±0.05 dB, depending 
on whether the circuit is hot ( - ) or cold ( + ). Likewise, 
amplifiers are largely unaffected by temperature. Fig. 5 
shows a set of six curves: two amplifiers, each measured 
at room. hot. and cold temperatures. The curves are nearly 
identical. Heat decreases the output power by an average 
of only 0.26 dB, and cold increases the power by an average 
of only 0,33 dB. (The temperature biases are nearly zero 
when the amplifiers are in compression.) Regardless of 
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Fig. 3. Amplifier response curves for three values of collector 
current at 470 MHz. 
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Power into Amplifier IdBm) 

Fig. 5. Six amplifier response curves measured at 5°C. 25°C, 
and 70°C lor two amplifiers 

output frequency, an overall lei uncertainty of 0.27 dB 
accounts for %'ariation in both temperature and amplifier 
components. 

The temperature effect is not negligible for switches. Fig. 
6 shows the difference in power between room and hot 
temperatures and between room and cold temperatures for 
two switch assemblies. Notice that this effect is not constant 
for all switches. In the region of interest (P in = 0to5dBm). 
the power at 25°C is roughly 0.8 dB higher than at 70°C for 
one switch and 0.96 dB higher for another switch. Simi- 
larly, one switch is 0.4 dB lower at 25°C than at 5°C, while 
a second switch is 0.3 dB lower at 25°C. The temperature 
biases for the two switch assemblies are averaged: AHot = 
-0.88 dB and ACold = 0.35 dB. The variation in these 
averages between different switch assemblies constitutes 
part of the component variation, which is discussed in the 
following section. 



Component Variation 

Component variation can be assessed in two ways: man- 
ufacturer specification or actual measurement data. The 
first approach requires a translation of the manufacturer 
specification into multiples of <r. Usually the best guess is 
to treat the specification as a 3cr limit. In this study, this 
approach was limited mostly to characterizing the variation 
of purchased low-pass and bandpass filters. More often, at 
least two circuits of each type (power splitters, printed 
circuit board filters, amplifiers, dividers, switches. XOR 
gates) were measured to provide more realistic determina- 
tions of the uncertainties in the components. Such uncer- 
tainty necessarily includes measurement error. 

We have seen the effects of component variation both in 
the amplifiers and in assessing the temperature biases on 
the switches. The component variation, like the tempera- 
ture effect, is small for the amplifiers: 1<t estimate of the 
combined variation is 0.27 dB. The component variation 
in the switch circuits is higher, as Fig. 7 illustrates. Here, 
only a portion of the switch curve is plotted to exaggerate 
the differences in the curves resulting from measuring two 
different circuits on two different boards. The lower curves, 
representing measurements of two circuits on hoard A. are 
clearly closer to each other than they are to either of the 
upper curves, representing measurements of two circuits 
on board B. The hoard variation is substantial; here, the 
difference is approximately 0.7 dB. This variation may be 
caused by various other components on the boards, such 
as the low-pass filter that follows the switch. On a given 
board, measured differences between circuits are smaller. 

A difference of two measurements can be translated into 
a la estimate by dividing the difference by . (In general, 
for Gaussian data, an estimate of a from the difference 
between the largest and smallest of n observations is ob- 
tained by dividing this difference by Vii. when n<10. See 
page 317 of reference 4.| This additional variation is in- 
cluded in the overall uncertainty statement, as we discuss 
in the following section. 
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Fig. 6. The upper curves show 
room-temperature (25°C) output 
power relative to hot-temperature 
(7CCC) output power as a function 
ol input power for two switches, a 
and b The lower curves show 
room-temperature output power 
relative to cold-temperature (5°C) 
output power lor the same two 
switches. 
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Fig. 7. Response curves lor two switches on board A (lower 
curves) and two switches on board B (upper curves) 

Combining Uncertainties 

The root sum of squares |RSS) rule and the Taylor series 
rule are special cases of the general law of the proposition 
of error, which can be used in a general way to combine 
estimates of variation into a single overall uncertainly state- 
ment. The general law of propagation of error says that, for 
a function of k independent variables 

y = f (x,. Xj x k ) 

the variance (square of the standard deviation) of y is ap- 
proximated by 

Var(y) = £ (af/6 , x,) 2 Var(x 1 ) 

where the partial derivatives are evaluated at the sample 
means of the k variables (see page 363 of reference 5). 
Notice that when k = 1, taking the square root of both 
sides leads to equation 1 above. In this paper, the response, 
output power, involves variations in input power, compo- 
nents, board version, and measurement error. Temperature 
effects are treated as additive biases. The simplest model 



for output power is an additive one: 

output power = A( input power) + component effects 
+ board effects + ME 

where A(input power) is an amplifier function like those 
shown in Figs. 3 and 5 and ME stands for measurement 
error. According to the law of propagation of error, 

Var(output) = A' 2 (Pi„) x Var(input) + Var(components) + 
Var(boards) + Var(ME) 

so it follows that 

SD(P DUI ) = [A' 2 (P in )a 2 n + o- 2 om „ + o-L, rd + 

which is the familiar root sum of squares formula. Notice 
that nowhere have we assumed a specific probability dis- 
tribution for any of these variables, only that the variables 
are independent. 

We are now ready to combine the previously identified 
sources of variation into a single uncertainty statement, 
applying this law after every circuit. We treat the power 
levels at the three different temperatures separately, since 
5°C and 70°C are extreme conditions, and in a given operat- 
ing environment, the temperature is not likely to vary over 
such a broad range. Also, the measurement error <r MK has 
been included in the measurement of component and board 
variation. 

Consider our previous illustration at room temperature, 
for which P in = 3.91 dBm ±0.94 dB. The power splitter 
and the pad incur power losses of -3.8 dB ±0.16 dB and 
1 dB ±0.00 dB respectively. Thus P jn to the amplifier is 
-0.89 dBm ±0.95 dB. The amplifier increases this power 
to 6.62 dBm, where the slope is 0.7, so the output variation 
is reduced to ±0.70 dB. Amplifier component and temper- 
ature variation of 0.27 dB raises this uncertainty to ±0.75 
dB. A 2-dB loss results in P in = 4.62 dBm ±0.75 dB to the 
switch, which itself has a slope of 0.76 at P ln . Thus the 
output power is -6.48 dBm ±0.57 dB. Combining this 
with the board variation (±0.40 dB) by the root sum of 
squares method, we have a final uncertainty of ±0.70 dB. 
A similar path of analysis follows from the power levels 
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Fig. 8. Triangles indicate the ex- 
pected values and the 2 5o- limits 
of the power output from each 
stage for three temperatures: 
room (right, color), hot (left, color), 
and cold (left, black) 
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Fig. 9. Room, hoi. and cold response curves lor an average 
ol lour prototype synthesizer boards .and a room-temperature 
response curve lor a lab pilot board 

at hot and cold temperatures. 

A graphical display of all these numbers is shown in Fig. 
8. The expected power level at each stage is indicated by 
the center of the triangle on the vertical axis corresponding 
to each circuit. There is one triangle for each temperature: 
room on the right, hot and cold on the left. The length of 
the base of the triangle extends to 2.5 multiples of the 
standard deviation of the power output at each stage. Thus, 
considerable information is conveyed in this display- 
changes in power levels across circuits, variations in power 
levels at different stages, and differences in power levels 
at extreme temperatures. The reduction in variation from 
the beginning to the end of the path can be seen, and the 
target P ou , = -6.5 dBm is attained within the desired 
limits at room temperature. 

Comparing Prototype Analysis with Lab Pilot Results 

The path of power shown in Fig. 1 can be formulated 
into a single functional relationship between input power 
to the power splitter and output power from the switch. 
This functional relationship is computed using linear inter- 
polation in the data collected on amplifiers anil switches 



and is shown in Fig. 9. A lab pilot board was measured for 
its power transfer between the initial and final points of 
the circuitry: it is shown in Fig. 9 for room temperature 
only. The agreement in the general shape of the curves 
between observed lab pilot measurements and expected 
values based on the prototype is excellent. Any absolute 
difference between the curves can be explained by imped- 
ance mismatches: pads can be adjusted accordingly. Once 
these pad values are ascertained for the lab pilot, target 
power levels are achieved within the designed limits, with- 
out the need for further adjustments of pads in production. 

More Complicated Analyses 

Not all power paths in the instrument are as straightfor- 
ward as the one illustrated in this paper. Fig. 10 shows a 
display similar to Fig. 8 of the uncertainties in a power 
transfer involving incoming power at one of two different 
frequencies. While the approach to amplifier design and 
estimation of component variation is similar, the estima- 
tion of variation resulting from input power is complicated 
by the presence of incoming power at another frequency. 
Thus, measurements must be taken by varying power levels 
at both frequencies. Other factors, such as board type and 
circuit components, can also be varied, and a statistically 
designed plan to take measurements on several different 
factors simulaneously is encouraged. 5 
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Statistical Calibration of a Vector 
Demodulator 

Circuit performance characteristics must be determined 
and accounted (or in instrument calibration. This paper 
describes an algorithm for statistical demodulator 
calibration to guarantee high precision in the demodulated 
signal. 

by Karen Kafadar 



VECTOR MODULATION INSTRUMENTS such as 
the HP 8780A Vector Signal Generator' and the HP 
8980A Vector Analyzer' provide a test system for 
measurement applications in such fields as digital micro- 
wave radio, communications, and radar. Some of the 
technologically new developments in this series of prod- 
ucts are the modulation capabilities (complex, wideband 
signals), the high-resolution CRT display, and the 350-MHz 
bandwidth. 

The precision measurement capabilities offered in these 
products raise the issue of the internal calibration of the 
instruments, All instruments need to be calibrated to ac- 
count for differences in components and test conditions. 
The accuracy and precision of these internal self-adjust- 
ments are crucial, since they have direct effects on the 
characterization of the device under test. The hardware 
measurements made for purposes of internal calibration 
must be combined in a logical and intelligent fashion to 
yield the correct calibration factors. 

Statistical methods play an important role in the calibra- 
tion routine. This paper describes a statistical approach to 
the calibration of the HP 8981A Vector Modulation Ana- 
lyzer. The next section briefly introduces the use of statis- 
tical methods in such situations. The measurement process 
for a vector demodulator is then described, using the HP 
8981 A as an illustration. Two sections provide the techni- 
cal details of the calibration algorithm both with and with- 
out an accurate vector signal generator (e.g.. the HP 8780A) 
as a reference. Finally, examples of real data illustrate the 
computations in the statistical algorithm and formulas for 
the uncertainties in the calibration factors. It will be shown 
thai the algorithm can be used to calibrate not only the de- 
modulator in the HP 8981 A but also external demodulators. 

Statistics in Calibration Routines 

Calibration consists of comparing a set of measurements 
from an uncharacterized instrument (e.g.. an HP 8981A) 
with a defined reference standard (e.g.. an HP 8780A) ac- 
cording to a measurement algorithm. Thus the calibration 
model relates the observed measurement readings to the 
reference standard. A model is never perfect, and the dif- 
ference between the model and reality can be characterized 
according to two types of errors that may be present: 

Pan ol ttirs paper appeared in MSrV 5 CommuntcaVons Technology. Ma/ch 1988 Reprmied 
bi permission ol Ihe publisher Copyrighi EW Communications Inc 



n Systematic errors resulting from incomplete specifica- 
tion of the calibration model 

Random errors, or small, unpredictable fluctuations that 
affect everv measurement but are themselves unmeasure- 
able. 

Statistical methods can be used to address both types of 
errors. First, the calibration should be designed to allow 
identification of possible systematic departures between 
the model for measurement and the observed data. Second, 
the influence of random errors can be assessed when es- 
timating the parameters in the model that relate to the 
measurement process and their uncertainties. In this paper, 
we shall assume that the random errors are independent 
and identically distributed according to a symmetric distri- 
bution; that is, that the individual errors cannot be pre- 
dicted in either size or direction, and that the chances of 
an erroneous measurement being loo large or too small are 
roughly equal. (Diagnostic tools for checking the validity 
of these assumptions and the consequences of their viola- 
tion are discussed in connection with the examples later 
in this paper.) 

Rarely is a single estimate of a target quantity sufficient. 
For example, reporting a sample mean without its standard 
error provides no information on the reliability of the data 
that went into that sample mean. The same is true for 
estimates of the parameters describing the measurement 
process. These estimates are the calibration factors, and 
certain limits of fluctuation may be desirable (e.g.. gain 
adjustment accurate within 1%). Therefore, an important 
part of the statistics in a calibration algorithm is the deri- 
vation of associated measures of uncertainty for the calibra- 
tion factors. 

Vector Analyzer Measurement Process 

In the HP 8981A, the phase-modulated signal can be 
expressed mathematically in terms of its frequency compo- 
nents as 

s(t) = £A k (t)cos(iu k t + d^) 

HI 

= |I k |t)cos(u) k t) + Q t (t)sin|io k t) 

Equation 1 expresses the signal in terms of its demod- 
ulated components I k (t) (in-phase) and QJt) (quadrature). 



18 HEWLETT-PACKARD JOURNAL JUNE 1988 



© Copr. 1949-1998 Hewlett-Packard Co. 



Components that vary over time can be observed by examin- 
ing I k |t( and Qjjl) over time at a particular frequency u> k . 
Ideally, at a fixed time, a single-frequency signal with con- 
stant amplitude is demodulated at 

1 = cos|a) 

Q = cosla - (90° - <*)) 

where a is the angle of the vector signal and 6. the error 
in the quadrature phase of the demodulation, is zero. 

Since Ihe HP 8981 A has the ability to display signals in 
either time or vector form, amplitude and phase informa- 
tion must be recovered accurately. This requires special 
software functions to estimate and subsequently account 
for any amplitude or phase offsets. Proper calibration of 
the HP 8981A thus allows simple measurements of 
amplitude and phase modulated signals. 

Fig. 1 is a block diagram for a simplified demodulator, 
such as the HP 8981A. which decomposes the modulation 
on a signal into its in-phase and quadrature components. 
Accuracy and precision in this decomposition are essential 
to assess degradation of the signal's amplitude, phase, or 
frequency, or to assess cross-talk between the two channels. 
Since I and Q signals are used often in measurement sys- 
tems, we will discuss the calibration scheme for the Q-vs-I 
vector representation. 

Six sources for adjustments in a vector demodulator can 
be identified, based on the block diagram of the instrument. 
These six sources correspond to stages in the transforma- 
tion of the ideal 1/Q signal into Ihe measured, or actual, 
I'Q signal. The calibration process uses statistics to estimate 
the sizes and directions of the adjustments attributable to 
these sources and then corrects the output so that the de- 
modulated I/Q signal agrees with the idea] I/Q signal. 

Let (I.Q) denote the pair of points representing the I and 
Q channels al B particular point in time at a given frequency. 
The HP 89H1A hardware transforms Ihe coordinates (I,Q) 
into measured coordinates (x.y) through the following 
series of operations: 



[I.QI — Rotation 
dc Offsets 



Axis Sheer — ► Compression 



x.v 



Rotation can be viewed as a phase change in the ideal 
signal, which can be introduced by any of the components 
before the demodulation. The axis sheer creates two addi- 
tional adjustments: quadrature adjustment, resulting from 
inaccuracy in the 90° split, and gain imbalance, resulting 
from different power levels in amplifiers or unequal power 
losses in other components after the split. The I/Q measure- 
ments are rescaled by amplifier compression, and dc offsets 
may be added to the measurements at any stage. 

Notationally, let us represent these adjustments as: 

■y = gain imbalance (I-channel/Q-channel) 

ili = angular difference between ideal quadrature (90°) 

and actual quadrature 
0 = angle of rotation (sometimes called lock error) 
p = scaling resulting from compression 
I u = dc offset in l-channel 
Q,, = dc offset in Q-chaunel 



The four steps in transforming (l.Q) to (x.y) can be ex- 
pressed mathematically as follows: 

■ Rotation through angle 0: 

I" cosO -sinfll 
1 sinO cosfl J 

This transforms (l.Q) intollcosfl - Qsinfl.lsintf - Qcos»). 

■ Sheer transformation (quadrature adjustment 6. gain im- 
balance -y): 



[ sin<6 cos<6 J 



This causes two additional adjustments: the ratio in the 
two channels (1/Q) is now -y. not 1, and the tangent of 
the angle between the two axes (I.Q) is now (90° - tb). 
not 90°. 

■ The measurements are uniformly compressed according 
to a factor p. 

i The measurements are offset by a fixed amount (I„ in 

the I-channel. Q, in the Q-channel). 

Putting these four effects together, the resultant point 
(x.y) can be expressed as a function of the ideal input (I.Q) 

as: 



7 0 
sine/) cosd> 



cosfl -sintf 
sinW cost? 



[Ml 



(2) 



Thfl unobserved variables e v .e v are assumed to be uncor- 
rected random errors in the measurement of the x and y 
coordinates. That is. 

x = I„ + 7(/x:o.sfl)I - ^(f)sin«)Q + e, 
= ot„ + a, I + a 2 Q + e x 

v = Qo + (pcosflsincp + />sin0cos</>)I 

+ ( - psin(tein</> + /x:osr7cosrfi)Q + e y 
= ft, + /3,I + fcQ t e. 
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Fig. 1 . Simplified vector demodulator 
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where: 



points having phases k(45°). k = 0. 1 7 (Fig. 2). Thus. 



«0 = 


State 


1 


Q 


a, = Ypcosfl 








a 2 = --ypsinW 


0 


1 


0 


Po = Qo 


1 




f 


/3, = pcosfteinrf) + psinflcosrf> 


2 




1 


= (a,sind> - a 2 cosi£)/-y 


3 




f 


^2 = - psintfsin<£ + pcosf?cos</> 


4 




0 


= (a 2 sin<£ + at,cos</>)/-y 


5 




-f 




6 




-1 


The errors of interest can be recovered via the equations: 


7 




-f 



lo = a o (I-channel offset) 
Oh = ft) (Q-channel offset) 
p= [0\ + fg$* (Compression) 

0 = tan~'( -a 2 /a,) (Rotation) (3) 

y = \[a\ + + $)|" 2 (I/Q gain imbalance) 

0 = tan 'Ka,/?] + a 2 /3 2 )/(c«,/3 2 - a,^)| (Quadrature error) 

Notice that, apart from rotation, the ideal 1 and Qcan be 
recovered via the equations 

I = (x - I 0 )/ 7 p 

Q = (7(y - Qu) - sin <b (x - I„)|/7p cos <*>. 

Estimation with Accurate Reference 

At this point, we need to distinguish between the case 
where ideal (I,Q) values are provided and the situation 
where only magnitude information about the (l.Q) points 
is known. Ideal (1,Q) measurements can be obtained using 
a precise signal generator such as the HP 8780A. so that 
signals having several accurately defined phase modes can 
be guaranteed. Since this situation lends itself to the easier 
algorithm, it will be discussed first. 

It is evident that an input signal of constant frequency 
and magnitude and variable phase, namely 



s(t) = Acos(<ut + <£,). t = 0,1. • . . 

results in an I/Q display for which 

I 2 + Q 2 = (Magnitude) 2 . 

which is a circle. It can be verified that the transformed 
points (x,y) lie on an ellipse. The parameters of this ellipse 
are related to the adjustments in the instrument as outlined 
above. 

Since we have six adjustments (parameters), we need a 
minimum of six phase states to obtain an exact fit for the 
parameters in equation 2. 

The 8PSK mode of the HP 8780A allows rapid generation 
of an I/Q signal at eight evenly distributed phase states, 
allowing two extra phase states for model confirmation. 
The coordinates (l.Q) of the points in any one of these 
states will be considered ideal, transformed (by the six 
adjustments) into measured coordinates (x,y). For simplic- 
ity, we label the eight ideal states sequentially at those 



(Numerical association 



where f = 1/V2 = 0.7071068. 
with each state is arbitrary.) 

Suppose n/8 points are measured in each state, so n mea- 
surements in all are taken. (This requirement is not essen- 
tial, but the equations are greatly simplified by encouraging 
orthogonality into the problem.) Denote the coordinates of 

these measurements by (x^y,) (x n .yn)- Associate the 

point with its ideal (I,Q) pair as described above: (Ij .Qi ). 
• • ■ . (I n .Qn)- Then we need to fit a = (an.aj.a;,)' and fi = 
(/3 u ,j3, . fi-,)' in the two relationships: 

x, = a 0 + a,l, + a 2 Qj + e x _ 

Yi = A, + &Ji + ftA + €y, 

The estimates of (a u ,a,.c« 2 ). namely (&,„&,. & 2 ), can be 
obtained by a least squares algorithm. If we let M be the 
matrix of n rows and 3 columns corresponding to the n 
observations and three "carriers" (constant term, l.Q). that is. 



1 o 



0.5 -- 



o.o - - a 



o 
6 



-0.5 - - 



-1.0 



0 



-1 0 



-0.5 0.0 -0.5 

Ideal l-Channel Measurement 



1.0 



Fig. 2. In-phase (I) and quadrature (Q) modulation states tor 
8PSK vector modulation 
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M 



I, Q, 



On _ 



and calculate 



S, - V ie;.'(n - 3) 



then it is well known 3 that 



= (M'M) 'M'x 



(M'MF'M'y 



(4) 



are the least squares estimates of a and fi, where x = 

(x,. x 2 x„)' and y = (y,. y, yj' are column 

vectors and the prime sign (') indicates transpose. Since 
there are n/8 measurements in each state, and the states 
are evenly spaced around the circle, the matrix M is ortho- 
gonal, and so 



M'M = 



0 0 

n/2 0 
0 n/2 



Furthermore. 



M'x 



X|I,. 1 xA 



r n n 

and thus 

a = (M'M) - 'M'x = [ave(x). 2ave (x-I). 2ave(x Q)|' 

0 = (M'MT'M'y = tave(y), 2ave (yl), 2ave(yQ)|'. 

The estimates of the target parameters (I 0 , Qo, p, 0, -y, <t>) 
can then be unscrambled using equations in (3) above. 



e?./(n - 3). 



The divisor is (n - 3) instead of n because three parameters 
have been estimated in each channel. If the departures from 
the model have a Gaussian distribution, we would expect 
most of the e,. to fall within 2s x or 3s x . and likewise for 
e y . This criterion may serve as a simple check for unusual 
measurements (but see "A Caution." page 24). 

Using the root mean squares of the model departures 
(residuals), we can calculate the standard errors of the pa- 
rameters in the model. Again assuming these residuals are 
Gaussian distributed, the standard errors of (d 0 . d,, d 2 ) are 
given by 



Thus, 



s x Vdiagonal of (M'M) 1 

SE(d 0 ) = SE(! U ) = s,/Vn 
SE{0 o ) = SEIQn) = sjVn 



The standard errors for the other target parameters are a 
little more complicated, because they are functions of u,, 
a 2 , /?,, and p 2 - We re 'y on propagation of error formulas' 1 
for approximations to the standard errors of functions of 
variables. The approximations are simplified if 

2s y 2 /(np 2 ) « 1. 

a condition that will often be satisfied if the fit is good (s v 
is small) and the number of points is moderately large. 
Then: 



Standard Errors 

We need some measure of uncertainty in our estimates 
of the parameters. For example, after calculating an esti- 
mate of -y, say y. how far off might that estimate be from 
the true value? One such measure is the standard error 
(SE) of the estimate. An approximate 95% confidence inter- 
val for the estimate is given by 

estimate ±2(SE). 

To calculate the standard errors for the six parameters, 
we first need a measure of goodness of fit in our model. 
This can be calculated most simply as a "root mean square" 
of the differences between the observed measurement x, 
or yj and its fitted value predicted by the model. Denote 
these differences, or residuals, by: 

e s , = X| - d 0 - d,I, - d 2 Qi 

e y , = y, - 0 O - - 0 2 Qi 



0.20 -i- 
> 0.15 

S o.io + 

E 
£ 

a 0.05 -- 

E 

■ 

* 

5 0 -- 
£ -0.05 -- 

2 

-0.15 -- 



-0.20 



-0.20 -0.10 0 0.10 

l-Channel Measurement (x) 



H 1 

0.20 



Fig. 3. Data lor the first example (see text). Six measurements 
were taken in each ol eight states. 
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SE(p) <* s y V27n~ 

SE(-y) = s x \/27n 

SE(tIh0) = s^p-yV^Tn/a? 

SE(tan>) - [ftg + s y )(l + l/^J/nl'^/lpcos 2 ^ 

where tanft and tan(f> are the estimates of tanfl and tandi. 
So approximate 95% confidence intervals for the estimates 
of the errors are: 

y ± mm) 

! 0 ± 2SE(i 0 ) 
Qo ± 2SE(Q,,) 
p_± 2SE(p) _ 
tan '[lan^ 21 2SE(tanfl)| 
tan~'|tin> ± 2SE(tanr/>)|. 

This calibration algorithm thus proceeds as follows: 

Measure (x^yj for i = 1 n 

Associate state (Ij.Qj) for i = 1 n 

Calculate: & = (d a ,6L,,a 2 )' 

Calculate target parameters: y,<t>,p.0.l„,Q„ 
Calculate: goodness of fit (s x .s v ) and residuals (e v ,e v ) 
Identify unusual residuals 
Calculate standard errors. 

Inaccurately Known Phase 

We now consider the situation where phase cannot be 
associated precisely with each measurement. Generally, 
amplitude information is more reliable, that is, 

I z + Q 2 = (I,Q)'{I.Q) = 1. 

Two possible signals lhat satisfy this requirement are: 
A signal with continuously varying phase (e.g., two 
tones, offset in frequency) 

A signal with eight discrete phases (e.g.. an 8PSK mod- 
ulated signal and its coherent carrier). 
Inverting equation 2, 



I 

Q 



= P 



x - I,) - e x 
y - Qo - e y 



and thus 



U.Q)'(l.Q) = (5) 

p' 2 [x -Io-tx-y-Qo- S)(S" 1 )'(S ')(x - l„ - e x . y - Q,, - e v ), 

because (R'R) = (R ')'(R ') is the identity matrix (R is 
orthogonal). This can be simplified by noting that 



(S- , )'(S- , )= (7COS*) 



= i-Ycos^r 



cos<6 -sind) 



0 



fcos</> 0 1 
-sin* y J 



1 - 'ysintfr 1 
-ysinrjS y 2 



Let us assume that the errors in measuring x and y have 
zero means, are uncorrected with one another, and have 
common standard deviation it. Then, on average. 

avelej = ave(e y ) = 0 
ave(e x ) = ave(e y ) = cr 



and so equation 5 becomes: 

(x-I,,)- - 2-y(sin<*)(x-I ( ,)(y-Q, l ) + -y 2 (y-Qo) 2 
(7-+ lyr-y^p-cos'-tl) = 0. 



[6] 



If measurement error is negligible compared to the mag- 
nitude of the adjustments, the fourth term in equation 6 
can be ignored. 

Equation 6 is nonlinear in the parameters, of which there 
are now only five (clearly the rotation, or lock error, cannot 
be estimated without relative phase information). Al- 
gorithms for nonlinear least squares often rely on gradient 
methods or on linearization of the problem via a Taylor 
series expansion. In addition, several methods for approx- 
imate confidence intervals for the parameters have been 
proposed. These two issues are described in the Appendix 
on page 24. 



25 -r 



20 



15 -- 



10 -- 



5 -- 




-0.002 0 0.002 0.004 0.006 0.008 

l a ' Residuals in x-Direclion (Transformed x -Ideal I) 



12 -r 



10 
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(b) Residuals in y-Oirection (Transformed y- Ideal Q) 

Fig. 4. Histograms of the x and y residuals (e,. e v ) tor the 
first example The histograms show the differences between 
the transformed measurements in each channel and their 
ideal values as shown graphically in Fig. 2 
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Examples 

This first example serves as a "control." Six (x.y) mea- 
surements were taken in each of eight states (Fig. 3). All 
calibration factors are essentially at their nominal values. 

[6oAl&l= (0.000054.0.16564. -0.068486) 
[0 O ,0 I .0 ? )= (- 0.002694. 0.068058. 0. 163904) 
i 0 = 0.00005 volts (I-channel offset) 
Qo= -0.00269 volts (Q-channel offset) 
p= 0.17747 volts (Compression) 
0= tan"' (0.39206) = 21.41° (Lock error) 
y= 1.0100 (|/Qgain imbalance) 
i= tan '(0.00152) = 0.0871° (Quadrature 
error), 

Root mean squares: s x = 0.0012162: s y = 0.0008686 volts. 
Approximate 95% confidence intervals (estimate ±2SE): 

Lj:( - 0.00044. 0.00055) volts 
Qo:( - 0.00295. - 0.00243) volts 

7:(1.0053. 1.0147) 

p:(0.17711, 0.17783) volts 
tand>:( - 0.00144. 0.00448) — d> : ( - 0.08°. 0.26°) 
tan(?:(0.39153. 0.39253)— d: (21.38°, 21.43°). 

Fig. 4 shows the histograms of the x and y residuals. 
Clearly there is one aberrant measurement. This measure- 
ment occurred in state 4 (I = — 1, Q = 0). A comparison 
of these estimates with those obtained with a more robust 
procedure (downweighting this point) verified that the 
point is not so extreme that it affected the quality of the 
overall fit. 

Using the algorithm for unknown phase (see above), 

I o :0.00019 ( - 0.00044, 0.00055) volts 
Q„: - 0.00243 ( - 0.00295, - 0.00243) volts 
7:1.0100(1.0053,1.0147) 
p:0.1 7748 (0.1 7711. 0.1 7783) volts 
tan(rf>):0.00230[- 0.00318. 0.00622) -» tf>:0.13° 
(-0.18°. 0.36°). 



Notice that these solutions fall well within the confidence 
intervals of the linear (known-phase) method. 

The second example involved a large quadrature adjust- 
ment. Eight (x.y) measurements were taken in each of eight 
states (Fig. 5). 

(d 0 .a,,d z )= (0.089621.0.143384. -0.292603) 
(&,./?,.#,)= (0.011144.0.227279.0.244292) 
\u = 0.089621 Volts (1-channel offset) 
Q,, = 0.01 1 144 Volts (Q-channel offset) 
/5= 0.33367 Volts (Compression) 
6= tan"' (1.11516) = 48.12° (Lock error) 
y = 0.97656 (1/Q gain ratio) 
<j= tan" 1 (-0.36852) = - 20.09° (Quadrature 
error). 

Root mean squares: s x = 0.0019728: s y = 0.0015534 volts. 
Approximate 95% confidence intervals (estimate 2:2SE): 



I 0 : (0.08913. 0.09011 1 volts 
Qo:(0.01076. 0.01153) volts 
7:(0.97586, 0.97726) 
p:(0.33312. 0.33422) volts 
tand>:(- 0.36263. -0.36891)- 



[-20.25°, -19.94°) 



lanfl:(1.10963. 1.12069)— : (47.97°. 48.26°) 




-0.004 - 0.002 0.0 0 002 0.004 0.006 

(a) Residuals in x-Direction (Transformed x- Ideal I) 
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Fig. 5. Data lor the second example. Eight measurements 
were taken in each ol eight states. 
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Fig. 6. Histograms ol the x and y residuals fe„ e y ) lor the 
second example. 
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Appendix 

Solving Nonlinear Least Squares Problems 

The accompanying paper discusses the solution to estimating the parameters in a 
mode! that Is linear in those parameters, e.g.. 

X, = t»o + a,l, * a 2 Q, + 1 = 1, ....n. 

The estimates are given expitcitfy in equation A of the paper. 

When the model is nonlinear in the parameters, such as in oquatton 6 ot the paper, 
one approach is !o approximate the function by a first-order Taylor series expansion 
and solve as tor the linear problem For a function, say t. of the data values (x,.y,) of 
only one parameter, say ft 

0 = - (| 

- '(x,.y ( ;e) - (e-0 o >dt(x,. y,;tf)/dot flfi + *, 

trom which it follows that 

B = «o - f(x„y ( ;^)/f(i( )t y l :flj <1) | 

Let tf 0 be a prior guess for the parameter ft Since there are n values of '(x.,y ( ;*U and 
f'(x,,y l ;0 o ), we can calculate a new n by replacing the quantities in equation i by their 
averages 

Equation 6 ot the paper actually has five parameters: 0 |-r,p,l 0 ,Q 0 ,«W. The multi- 
variate extension of Taylor's theorem becomes 

where [^f(x,,y,;^,|«*<J is a matrix of partial derivatives of the function f with respect to 
each of the five parameters Lei 'o denote the column vector of n values of the 
function f(x,.y,;flb), i = 1, ... n and let W be the n - 5 matnx ol values of the function 

<rf(x,,y t ;<U/afy. 1*1 n. k - 1 5, ail evaluated at the current guess Hq Then, ideally, 

f(x„y.;ff) - 0, so 

0 l D h W(» - » 0 ) 

or 

-f 0 - W(« - fl 0 ). {2) 

This problem is in the same format as that for the linear least squares p-'obiem, where 
W was the n*3 matrix M of rows consisting of 1s. I„ and Q ( , and 0 was the vector of 
parameters denoted by a. 

Like equation 4 of the accompanying paper, the solution to equation 2 is given by 

i» - 0o) ~ -(WW) 'W'f 0 

and Ihus 

0™. «o -(WW) 'W( 0 (3) 

Equation 3 is iterated until satisfactory convergence is obtained. (For the examples 
in the accompanying paper, three to tour iterations assured convergence to the tilth 
significant figure.) This method ot finding the ieast squares solution to a nonlinear 
problem is known as a Gauss-Seidel iteration and has nearly quadratic convergence 
when the data is known to fit the model well The algorithm and other methods are 
described in reference 6 listed on page 25. 

The coverage ol approximate confidence intervals for parameters solved by nonlinear 
least squares algorithms such as this one has been investigated (see reference 5 listed 
on page 25) The simplest and easiest approximation to the standard error ot is 
given by 

i"r|kth diagonal element ot (WW) f* 

where B is the standard deviation ot the residuals 

However, an interval of 1 2 of these "standard errors around Ihe estimated param- 
eters may have coverage as low as 75% instead of the nominal 95% Hence, lor 
conservative purposes, we recommend using the multiplier 3 lor an approximate 95% 
confidence interval Further details can be tound in references 5 and 6 listed on page 25. 



Fig. 6 shows the histograms of the x and y residuals. 
Using the algorithm for unknown phase (see above), 

l„:0.08871 volts 
Qo.-0.Oi1 87 volts 
-y:0.97723 
p:0.33978 volts 
tan {<b): - 0.36460 -* : - 20.03°( - 0.18°. 0.36°). 

Notice thai a large quadrature adjuslment ( - 19.25°) and a 
slight adjustment in the gain imbalance are indicated. The 
other errors are considered negligible. 

A Caution 

If Ihe model departures (residuals) are not Gaussian dis- 
tributed for any reason (e.g., outlying points as in the first 
example, or misidenlified slates), or if there are unequal 
numbers of measurements in each stale in the known-phase 
algorithm, the estimates of the target parameters and their 
standard errors may be seriously biased. Least squares fit- 
ting procedures have a tendency to make all residuals about 
Ihe same magnitude, even if, in fact, the residuals in all 
measurements were tiny, except for one. One indication 
of this case is if the median of the residuals is not zero, 
even though their average is. But this criterion would be 
ineffective for detecting (wo residuals in opposite direc- 
tions. To reduce Ihe effect of such aberrant measurements, 
Ihe algorithm is made more robust by assigning a weight 
to each measurement based on the magnitude of its depar- 
ture from the model. This feature is essential to avoid seri- 
ous bias in the calibration factors. 8 ' 7 

Summary 

A vector demodulator can be calibrated using a statisti- 
cally-based algorithm to determine the size and direction 
of the adjustments necessary to guarantee accuracy of the 
output. The uncertainties in these adjustments can be de- 
termined based on some assumptions aboul the measure- 
ment process: 

The ideal (I.OJ measurements of the signal in the two 
channels have been transformed to (x,y) measurements 
according to the following series of operations (listed in 
order of occurrence): rotation (lock error), sheer (quadra- 
ture, gain imbalance), compression, dc offsets. 
Errors in the measurements in the Iwo channels (x,y) are 
uncorrelated. 

When the relative phases among Ihe points are known, 
data is collected in phase-stepping mode for eight stales, 
and the stale (0 7) can be associated with each mea- 
surement. It is convenient if the number of measure- 
ments is the same in each of the eight states. 
When phase information is known only imprecisely, 
measurements are made that ideally lie on a circle in 
Ihe 1-Q plane. Input signals must have constant 
amplitude and varying phase (e.g., continuously, as in 
a two-lone setup, or discretely, as for 8PSK). 
Based on these assumptions, a straightforward linear 
leasl squares algorithm can be used to determine the sizes 
and directions of the adjustments when relative phases of 
the incoming signal (such as 8PSK from Ihe HP 8780A) are 
known. If this is not the case, then a linearization of the 
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nonlinear least squares problem can be solved, with some 
loss in precision of the adjustments and increased compu- 
tational time. Since this approach can be used more gener- 
ally to calibrate external demodulators as well, it is the 
algorithm currently programmed in the HP 8981A. In both 
cases, approximate 95% confidence intervals for the adjust- 
ments can be derived. 

Examples with data collected on the HP 8981A using 
the HP 8780A as a reference signal illustrate the implemen- 
tation of the algorithms. The examples illustrate the impor- 
tance of incorporating an efficient and robust calibration 
algorithm using statistical principles to control the 
hardware. Proper modeling of the system combined with 
the appropriate methodology results in accurate calibration 
factors as well as an assessment of the uncertainty in these 
factors. 
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An Availability and Reliability Calculation 
Tool for Computer Systems 

This software program helps R&D engineers make trade- 
offs in designing for reliability. It also provides marketing 
representatives with a tool to evaluate the reliability and 
availability of customer-defined systems. 

by Wulf D. Rehder 



AS OFFICE NETWORKS, computer clusters, multi- 
processors, and other ensembles of hardware de- 
vices are integrated into large systems, the tradi- 
tional problems of reliability, availability, and serviceabil- 
ity/supportability (the RAS features) gain a new and in- 
creased importance. The advent of complex systems that 
provide a variety of solutions rather than just computations 
has shifted the computer dependability issue from compo- 
nent reliability requirements to the customer-oriented con- 
cept of system availability. 

A new algorithm for modeling the RAS features of large 
systems has been implemented in a prototype software tool 
for internal use within Hewlett-Packard. Before illustrating 
the benefits of the RAS tool, it is important to clarify the 
distinction between device reliability and system availabil- 
ity and serviceability. 

Fig. 1 shows how a system might degrade through several 
stages during which useful work can still be done. For 
instance, a printer or workstation may be down temporari- 
ly, or may not be functioning optimally. In a situation like 
this the system is to some degree imperfect, but still usable. 
In most cases, a system with such operational faults is still 
available for proper, albeit reduced, service. 



Designing for this possibility of controlled degradation 
and high availability in the presence of faults requires the 
inclusion of means to handle fault diagnosis, system recon- 
figuration, and operational recovery. Preferably, this is all 
done intrinsically by a local control or support processor 
or remotely from a central support facility. 

Not every fault results in a failure, even though the sys- 
tem as a whole may contain a transient or permanent error. 
There exist many examples of such fault-tolerant system 
behavior. A well-known example of a transient error is an 
alpha-particle strike of a memory cell causing a one-hit 
error. If the system is capable of one-bit error correction, 
then system operation is not affected at all. As another 
example, deconfiguring certain defective parts, from mem- 
ory and processor boards to peripherals, does result in a 
degraded system, but not necessarily in system failure. 
Similarly, a defect (bug) in a software program need not 
corrupt the system operation. 

On the other hand, if the effect of an error escalates 
sufficiently to prevent the continuation of acceptable ser- 
vice for the user, then the system state falls below the 
threshold of availability, and a permanent failure occurs 
that requires external interference and repair. 




26 HEWLETT-PACKARD JOURNAL JUNE 1988 



© Copr. 1949-1998 Hewlett-Packard Co. 



RAS Calculations and Sensitivity Analysis 

For a specific system configuration, the RAS tool com- 
putes the availability A. mean time between failures MTBF. 
annualized failure rate AFR. mean time to repair M l IK. 
and expected total up and down time per year, given the 
failure and repair rates of the individual devices. Further- 
more, the unit costs are added and the ratio AFRVKSVAS 
is computed, where K$V AS = thousands of dollars of value 
added shipments. 

To make data entry easy and the output format clear and 
comprehensive, segmented windows on the screen prompt 
the user for input values and the softkeys are activated to 
browse back through the data and edit it if needed. Results 
of the calculation can be inspected on the screen or can be 
printed in the form shown in Fig. 2. Here an abbreviation 
such as 2(1 ) means that out of n = 2 devices present only 
k = 1 is needed for the system to be operational. 

Besides providing the kind of snapshot of the major fig- 
ures of system merit shown in Fig. 2, the RAS tool's flexible 
capabilities can be employed for the purpose of a "What 
if..." sensitivity analysis. It will answer questions about 
the impact of additional redundancy not only on system 
availability, but also on its complexity, its cost, and the 
ratio AFR'KSVAS. Likewise, some simple trade-off deci- 
sions about where to employ redundancy and how to in- 
crease service can be made. For instance, will a 2-out-of-3 
(triple modular) redundant power system design more ef- 
fectively increase RAS than, say, selective dual porting? 



There are many complex technical issues and questions 
of resources, market requirements, and competitive pres- 
sures that determine the outcome of many trade-offs, prob- 
ably even more so than the purely numerical suggestions 
the RAS tool is able to give. But for an initial assessment 
of the relative impact of different configurations on the 
RAS features, the tool is valuable, fast, and easy to use. 

Redundancy Modeling 

The basic model assumption requires that the system 
configuration be broken up into a series of device classes 
made up of devices or units or components, which may be 
field-replaceable units (FRUs) or more complex devices 
like peripherals. It is in these device classes or modules 
that the RAS tool allows any kind of k-out-of-n:G redun- 
dancy. This means that at least k out of the n units present 
have to be working (to be G = good) for the specific device 
class to be considered up, or operational. For instance, a 
3-out-of-3:G printer constellation means all 3 printers are 
necessary. l-out-of-2:G processors is an assembly of two 
parallel (masking) processors, and a 92-oul-of-118:G setup 
of workstations means that the device class of 1 18 worksta- 
tions is considered available and acceptable to the user 
even if as many as 26 of them have failed or are off-line. 

Another useful property that we believe is new in the 
RAS tool is the possibility of including device classes com- 
posed of several different but functionally equivalent units. 
The device class of 112 workstations may, for instance, be 



ACME Computer System 
(All numbers are fictitious) 



Contribution to System Failure Rate for Each Device Class 



ACME Computer System 
(All numbers are fictitious) 



AFR (%'yeaf) 


1558.67 


MTBF (hours) 


562.02 


MTTR (hours) 


4.74 


Cost(KS) 


92.70 


AFR/KS 


16.81 


Availability (%) 


99.16 


UPTIME (hours) 


8686.79 


DOWNTIME (hours) 


73.21 



• Assumes 1 year = 8760 hours of operation 



SPU 74.98 

SPU 1(1)" 

Memory 72.00 

4M-Byte & Controller 3(3) 

5M-Byte Memory Board 4(4) 

I/O 117.99 

Channel Adapter 4(4) 

IB 5(4) 

LAN 3(3) 

Disc Drives 299.74 

Drive Type I 5(5) 

Terminal I/O 0.00 

DTC 6(3) 

Workstations 0.09 

Terminal 70(68) 

Printers 893.76 

Printer Type I 1(1) 

Tape Drives 100.12 

Tape Drives I 3(2) 

Tape Drives II 1(1) 



" Total number of devices (Minimum that must be functioning) 



Fig. 2. Results of RAS tool calculations 
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The Language of Dependability 



The reliability of a syslem as a function of lime. R(t), is the 
probability lhal the system will survive throughout the entire inter- 
val [O.t]. Reliability as a dependability measure is most useful 
for systems in which repair cannot take place (space missions), 
or in which the whole system's operation is so critical that even 
short-time repair or temporary degradations are impossible, or 
when repair is too expensive. 

The availability of a system as a function of time, A(i). is ihe 
probability that the system is operational at time t. If the steady- 
state availability exists, that is, if the system availability stabilizes 
over lime at a constant value A. then A expresses the average 
time during which the system is capable of performing useful 
work Such useful work Is done in a machine or system state that 
is acceptable to and defined by the user or customer Availability 
is useful as a figure of merit primarily in systems in which service 
can be delayed or denied tor short periods while the system 
functions in a degraded mode 

Fault-tolerant computing means the continued correct execu- 
tion of a specific algorithm in the presence of operational faults. 
For this to work, the usually detrimental effects of defects must 
be overcome by employing hardware and/or software tech- 
niques Typical hardware methods include redundancy, but usu- 
ally software and hardware have to work together for maximal 
effectiveness. As a consequence, fault-tolerant computing be- 
comes a topic of design and architecture rather than being an 
issue of hardware components. 

Trade-Offs 

While high reliability is a no-compromise business, high avail- 
ability and fault-tolerant computing require delicate trade-offs. A 
trade-off needs to be made whenever a response variable V (for 
instance, customer satisfaction) depends on at least two conflict- 
ing variables, one of them pulling V up. the other pulling V down 
(for instance, the benefits versus the costs of high availability) 
Trade-offs are unavoidable design decisions for complex sys- 
tems, especially at the high end. Ideally, trade-offs would op- 
timize every important technical and market-driven response vari- 
able under the constraints and boundary conditions of resources, 
expected return on investment, technology, and competitive 
pressure. 



composed of 80 HP 150 Computers and 32 HP Vectras, or 
any other combination. While the 150 and the Veclra have 
different failure rates, they are, as workstations, function- 
ally equivalent (even if. for example, one is faster than the 
other). 

Markov Chain Algorithm 

McGrady' describes an algorithm to calculate the avail- 
ability of a k-out-of-n:G subsystem with equal or different 
components. Instead of reproducing this procedure in its 
generality here, we give a simple example of a parallel 
subsystem, that is. a l-out-of-2:G device class. Say that this 
is the jth subsystem (of a total of m subsystems in the 
system) and has component availabilities An, i = 1,2, 
which are computed from the respective failure rates X,, 
and repair rates rfijji 



An - <V( ^ii + x nl- 

The repair rale 4>y t is related to the corresponding mean 
repair time MTTRh by 



<t>„ = 8760/MTTR,,. 



if a full duty cycle of 87B0 operating hours per year is 
assumed. The subsystem availability is 

A| = 1 - (1 - AjjJti - A i2 ). 

McGrady's routine to compute such device class avail- 
abilities is based on the idea that Aj can be written as 

Aj = %%ta + (1 - AjtJifAj, + (1 - A B ]/A, 2 |. |1) 

Generalizing to more complex k-out-of-n:G subsystems 
leads to similar expressions, For instance, the correspond- 
ing expression for a 2-out-of-3:G device class (sometimes 
encountered in power systems) is: 

A^A^A^I-Hl -AiJ/A.-. + ll-A^l/A^ + ll -A |3 )/A, 3 ]. 

(2| 

See reference 1 for more examples. 
Finally, the product of all the subsystem availabilities 
gives the total system availability A_: 



A 5 = 11 A, 

f—i 



The question is how to compute the system failure rate 
and system MTBF from the knowledge of the subsystem 
availabilities A, and Ihe syslem availability A s . As it turns 
out, knowing the subsystem availabilities and their math- 
ematical forms (e.g.. equations 1 and 2 above) is sufficient. 
The method comes from the theory of Markov chains. 

Markov chains are capable of modeling the different pos- 
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Fig. 3. ft AS tool block diagram. The tool models degradations 
and redundancies at the CPU and peripheral levels, and 
computes reliability measures tor general customer-defined 
configurations. 
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sible states (e.g., of degradation) of a system. In particular. 
Markov chain tools allow the calculation of steady-state 
(matured) failure rates A, for all subsystems. The steady- 
state failure rate X, for the jth subsystem can be expressed 
as a weighted sum involving the device availabilities Au 
and the repair and failure rates of the devices in the subsys- 
tem. The details are given in Ross. 2 For example, the failure 
rate of a parallel subsystem turns out to be: 

X, = X,,A n (l - A 12 ) + Wd - A„). 

where An and A r , can be expressed in terms of the d>y, and 
\ji of the devices in the subsystem. 

The failure rate X, of a system that is a series configuration 
of subsystems is the sum of the subsystem values X,. 

Finally, the system MTBF, MTBF S , is given by the recip- 
rocal of X,: 

MTBF S = 1/ X s . 

which is very familiar to engineers. 
The average repair time MTTR S for t he system is equal to 

MTTR S = MTBF S (1 - A s )/A s . 

Fig. 3 is a block diagram showing the major building 
blocks of the RAS tool. 

Lotus Compatibility 

Currently the RAS tool is installed in an HP 3000 Series 
58 Computer. Future versions are being prepared to run 
on faster machines. A simplified algorithm for small to 
medium systems has been implemented in a Lotus" 1-2-3* 
spreadsheet format to make its use more interactive. 

As an example. Fig. 4 shows how the device class failure 
rates change when the configuration from the example in 
Fig. 2 is altered as follows: 

Lotus and 1 2-3 are registered trademarks ol Lotus Development Corporation 



2 parallel SPUs instead of 1 SPU 

2 out of 3 4M-byte memory boards instead of 3 boards 

3 out of 4 tape drives I instead of 2 out of 3 tape drives I 

The Lotus version of the RAS tool can handle the relia- 
bility impact of such configuration changes as well as 
changes in component failure rates, repair times, or prices 
in a few minutes, giving the designer a quick evaluation 
of design trade-offs and providing the sales representative 
in the field with a hands-on tool to discuss RAS options 
with a customer. 

A more ambitious enhancement to include stress factors 
(e.g., temperature) is in its investigation phase. 
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Project Management Using Software 
Reliability Growth Models 



At HP's Lake Stevens Instrument Division, the Goel- 
Okumoto software reliability growth model has provided a 
means for more accurately predicting the duration of system 
testing and determining when to release a new product. 

by Gregory A. Kruger 



SIGNIFICANT IMPROVEMENT in software reliabil- 
ity calls for innovative methods for developing soft- 
ware* determining its readiness for release, and 
predicting field performance. This paper focuses on three 
supporting strategies for improving software quality. First, 
there is a need for a metric or a set of metrics to help make the 
decision of when to release the product for customer ship- 
ments. Second, accurately estimating the duration of sys- 
tem testing, while not directly contributing to reliability, 
makes for a smoother introduction of the product to the 
marketplace. 

Finally, achieving significant improvement is easier 
given the ability to predict field failure rates, or perhaps 
more realistically, to compare successive software products 
upon release. Although this third strategy will be discussed 
in this paper, the emphasis will be on choosing the right 
software reliability metric and confidently managing the 
testing effort with the aid of software reliability growth 
models. For a more thorough discussion of estimating soft- 
ware field failure rates, see Drake and Wolting. 1 

Defects per Hour as the Quality Metric 

Two principal reliability measures are: 
Defect density expressed as defects per thousand lines 
of noncomment source statements (KNCSS) 
Number of new defects found per hour of system testing. 
The first measure will show progress on successive soft- 
ware projects, while the second measure is more helpful 
for real-time project management. 

The user is unlikely to perceive a software product in 
terms of lines of code, since this is an invisible dimension 
of the product. The customer perceives quality in terms of 
how often a problem is encountered. This is analogous to 
a hardware product. We speak of the mean time between 
failures (MTBF) for repairable instruments, and in fact, 
failure rate and its reciprocal. MTBF. are appropriate mea- 
sures of software quality. It is likely that, however different 
the sizes in lines of code, if two software products have 
the same failure rate during operation, they will be per- 
ceived as having the same level of reliability by the customer. 

Although ascertaining the customer-observed failure rate 
is difficult at best, measuring the failure rate during system 
testing is straightforward. For each successive time period, 
a week for example, record both the engineering hours 
invested in testing and the number of new defects found. 



Dividing the number of defects by the hours spent testing 
gives an estimate of the "instantaneous" new-defect-fiiul- 
ing rate. As defects are found and removed from the soft- 
ware, this rate declines. The decreasing new-defect-finding 
rate and the corresponding increasing mean time between 
finding new defects provide the criteria by which progress 
and the ultimate conclusion of system testing can be 
judged. 

Reliability Growth Modeling 

There is considerable statistical literature on modeling 
the reliability growth process of finding and fixing defects 
in a software product. Available models can be classified 
into four broad categories: 2 

Input-domain-based models 

Fault-seeding models 

Time-between-failures models 

Fault-count models. 

Input-domain-based models can be characterized as a 
basic sampling theory approach. One defines the input 
domain of a software system to be the set of all possible 
input data the software will encounter during operational 
use. The reliability can then be estimated by taking a rep- 
resentative sample from the input domain and looking at 
the resultant failure rate when the sample data is input to 
the system for execution. Although the input domain sam- 
pling procedure will not be strictly followed here, the con- 
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Fig. 1. Typical plot of m(t) and (ft) lor the Goel-Okurnoto 
model 
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cepts embodied in this technique turn out to be helpful in 
designing the testing process and understanding the limi- 
tations of making estimates of field performance based on 
defect-finding rates observed during system testing. 

The approach in fault-seeding models is to seed a known 
number of defects into a software product. The software is 
then tested using a process that presumably has equal prob- 
ability of finding a seeded or an indigenous defect. The 
numbers of indigenous and seeded faults found are used 
to estimate the system reliability. This concept has several 
implementation problems and draws considerable skepti- 
cism from project managers and designers alike. 

The preferred data in time-between-failures models is 
the observed running times between successive failures. 
Typically it is assumed that the time between two consecu- 
tive failures follows a distribution whose parameters are 
dependent upon the number of defects remaining in the 
software during that time interval. These distribution pa- 
rameters can be estimated from the observed time-between- 
failures data. However, the actual running time between 
successive failures may be a difficult measure lo extracl 
from the software testing process. 

In fault-count models, the variable of interest is the 
number of defects observed per specified time interval. 
The basic idea is that the number of failures observed per 
time interval can be modeled according to a Poisson pro- 
cess. The Poisson distribution is widely used to model the 
number of occurrences of some event in a time or space 
interval. As previously noted, the defect-finding rate statis- 
tic is relatively easy to capture. For this reason, the focus 
of this paper will be on the category of fault-counl models. 

John Musa's execution-time model, discussed by Drake 
and Wolting.' is an example of a fault-count model. Al- 
though their derivation differs from Musa's, Goel and 
Okumoto 3 propose essentially the same model. Because of 
its simplicity and inttiitiveness and this author's preference 
for the derivation of the model from the underlying assump- 
tion of a Poisson process, the Goel-Okumoto model will 
be used here. 

The Goel-Okomoto Model 

The model assumes that between software changes, the 
number of defects observed during each hour of test will 
follow the Poisson distribution with a constant average. A. 
Note that this defect rate includes the occurrence of both 
new and repeat defects. If the observed defects are fixed 
immediately, the Poisson parameter A will be decreasing 
every hour of testing (or at least decreasing with every code 
correction). In this case, we have a nonhomogeneous Pois- 
son process and can expect the defect-finding rate to de- 
crease exponentially until software changes cease. Con- 
versely, as the defect rate declines, the cumulative number 
of defects found will increase and asymptotically approach 
a constant. Since there is a lag in the correction of defects, 
the ideal state of instantaneously fixing defects when found 
can be approximated by counting only the new defects 
found each hour. 

The functional form of the model is as follows. The 
cumulative number of defects found by time I is given by 

m(t) = a(l - e' 1 ") 



and the instantaneous new-defect-finding rate at time t is 
given by 

l(t) = m'(t) = a/So-*'. 

In this model, a is the expected total number of defects 
in the software, fi is the initial defect-finding rate divided 
by the expected total number of defects in the software, t 
is the cumulative hours of system testing, and m'(t) denotes 
the derivative of m(t). 

Figure 1 shows the typical shape of l(t) and m(t). 

It should be noted that the parameter u is actually an 
estimate of the total number of defects the system testing 
process is capable of detecting. The input domain concept 
is useful for understanding this point. If there is some 
stratum of the input domain that the system testing process 
is not covering, the parameter a is actually an estimate of 
the total number of defects minus those generated by data 
from that specific stratum. 

There are three key assumptions underlying this model: 

0 All new coding is completed before the start of system 
testing. 

1 Defects are removed with certainty and without intro- 
ducing new defects. 

■ Testing methods and effort are homogeneous, that is. the 
last hour of testing is as intense as the first. 
The first assumption ensures that the defect-finding rate 
follows a monotonic pattern. While the second assumption 
is difficult to guarantee, instituting strict security measures 
will minimize the rate of defect introduction. The phasing 
of distinct test activities, the introduction of new testing 
tools, the addition of new team members, and software 
testing burnout make meeting the third assumption dif- 
ficult, as well. However, good planning of the testing pro- 
cess and averaging the defect-finding rale over a calendar 
week will tend to smooth out differences in testing effort. 

Fitting the Model on a Project 

Consider the application of the Goel-Okumoto model lo 
a 90-KNCSS firmware producl. This product had com- 
pleted system testing eighteen months earlier, but records 
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were available from which a data set could be constructed 
to test the feasibility of the model. For each of 43 weeks 
of system testing, the cumulative test hours, the instantane- 
ous new-defect-finding rate, and the cumulative number 
of defects found were calculated. However, because the 
number of hours invested in system test during the first 
several weeks of testing was highly variable, the first 19 
weeks of data was condensed into five data points, each 
representing approximately 200 hours of testing. This left 
29 sets of data values, each representing at least 200 con- 
secutive hours of testing. The nonlinear least squares fit of 
the the l(t) model to this data appears in Fig. 2. The obvious 
departure of the actual defect-finding rate data from the 
model at about 9000 cumulative hours appears to be 
explained by the addition of several experienced designers 
to the testing team. This is a clear violation of the assump- 
tion that the testing effort remains homogeneous. Using 
the parameter estimates obtained from fitting l(t). the m(t) 
model and cumulative data are plotted in Fig. 3. Parameter 
estimates and standard errors from fitting l(t) are: 

Kstimates:d = 2307 Standard Errors: a a = 362 

/l = 0.000384 Up = 0.000102 

There are some statistical advantages to fitting l(t) to the 
defect-finding rate data and using the resulting parameter 
estimates to plot m(t). By the nature of cumulative data, 
the residuals from fitting m(t) are serially correlated. How- 
ever, the residuals from fitting l(t) to the instantaneous 
defect-finding rates are well-behaved, more closely con- 
forming to the standard assumption of normally distributed 
errors with mean zero. On the other hand, l(t) can only be 
fit if there are enough hours of testing and defects found 
to generate reasonable estimates of the instantaneous de- 
fect-finding rate. If a project will be undergoing a small 
number of test hours so that blocks of time (such as a week) 
cannot be aggregated to give an estimate of the defect-find- 
ing rate, then it is difficult to obtain parameter estimates 
by the method of least squares. In this case, maximum 
likelihood estimation should be used to fit the m(t) function 
to the cumulative data. This should not be interpreted to 
mean that maximum likelihood estimation is an inferior 
technique to be employed when the least squares method 
is not feasible. On the contrary, it can be demonstrated that 
maximum likelihood estimates are among the best. The 
intent here is to show what is possible with simple least 
squares procedures. 

Using the Model in Real Time 

The project discussed above, project A, demonstrates 
that real software testing data does follow the nonhomoge- 
neous Poisson process model. Another project, project B. 
an application software product consisting of 100 KNCSS, 
demonstrates the potential for using the defect-finding rate 
metric and the Goel-Okumoto model in managing the sys- 
tem testing process. 

Setting the Release Criteria. First, a defect-finding rate ob- 
jective must be incorporated into the existing set of software 
release requirements. The total release checklist would 
then include such things as all functionality complete, all 
performance specifications met, no known serious defects, 



and the defect-finding rate criterion. The question remain- 
ing is, "What would be an appropriate value for this defect 
rate?" Project A concluded system testing with a rate of 
0.03 defects per hour. As it turned out, the firmware proved 
to have extremely high reliability in the field. However, it 
is not clear that a 0.03 defect-finding rate is appropriate 
for all projects — particularly on software rather than 
firmware projects. In addition, arguments can be made that 
the relative severity of each defect found should be used 
to express the release criteria in terms of a weighted defect- 
finding rate. Ultimately, based upon work at other HP di- 
visions and the belief that the defect weighting would be 
quite severe, a release requirement of 0.04 weighted defects 
per test hour was established. A team of four project en- 
gineers reviewed all defects found and scored each on a 
1-9 scale with 1 being a cosmetic flaw and 9 being most 
serious. These scores were then normalized by dividing 
by nine. The team recognized that the software reliability 
modeling process would be dependent upon their ability 
to weight the defects consistently throughout the duration 
of the project. 

Applying the Model. Once the release requirement is set, 
the Goel-Okumoto model becomes useful as a real-time 
management tool. Having fit the model to the most recent 
data, an estimate for the total test hours required to release 
is obtained by calculating where the curve crosses the re- 
lease requirement. The management learn can then review 
weekly plots of the defect-finding rate and estimates of the 
total test hours to release. These plots enable the team to 
predict the project's conclusion more accurately and to 
judge progress along the way. In addition, the engineering 
staff conducting system testing may find the data to be 
morale building. Without some measure of progress, system 
testing is often perceived to be a never-ending process. 

Project B ultimately reached the release goal after 5760 
hours of testing over 22 calendar weeks. Figs. 4 and 5 show 
the final plots of the l(t) and m(t) functions fit to Project B 
system testing data using nonlinear least squares. The pa- 
rameter estimates and standard errors from fitting l(t) are 
as follows: 



Estimates: d = 1239 



Standard Errors: a,. = 116 



j§ = 0.000484 



On = 0.000069 
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One can simplify the analysis by first taking the natural 
log of the defect-finding rate data. A quick check of the l(t) 
function verifies that taking the log transformation makes 
the model linear. Fig. 6 shows the regression line fit to the 
logged data. The benefit of operating with a linear model 
is that standard techniques can be used to set confidence 
bounds on the estimates of total system test hours needed 
to reach the release requirement.'' As can be seen from Fig. 
7. the estimates of total test hours proved to be both consis- 
tent and accurate. The project concluded system testing 
within 300 hours of the estimate made eleven weeks previ- 
ously. The accuracy of these estimates far exceeded our 
expectations. 

Estimating Field Failures 

Two different but related estimates can be made concern- 
ing the number of defects the customer will observe in the 
first year after release. The first is the expected number of 
times the user will experience a defect in the system includ- 
ing the recurrence of a previously observed defect. The sec- 
ond is the expected number of unique defects encountered . 
New And Repeat Defect Rate. Including repeat defects in 
estimates of a software system's failure rate can be consid- 
ered as taking the perspective of the customer. The cus- 
tomer perceives quality in terms of how often a defect is 
encountered. The user may not be able to tell if two or 
more defects are actually caused by the same fault in the 
code. Estimating the combined new and repeat defect rate 
may be possible using the I(t) model fit during system test- 
ing. Recall that the defect-finding rate declines because 
defects are removed from the software as they are found. 
Once QA ends, defects are no longer being fixed. Therefore, 
defects should now be observed according to a Foisson 
process with a fixed average number of defects per hour 
of use. The best estimate available for this is the defect-find- 
ing rate observed at the conclusion of system testing — 0.04 
weighted defects per hour for Project B, The question is, 
how good an estimate is this for what the customer will 
experience? 

The answer is dependent upon the difference between 
a customer's and an HP engineer's defect-finding effi- 
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Fig. 5. Cumulative defects each week tor project B software 

ciency. It is likely that the test engineer, rather than the 
customer, is the more efficient at finding defects. If an 
acceleration factor. K. could be identified expressing the 
ratio of customer hours to one hour of system testing, the 
customer's defect-finding rate could be estimated as 1/K 
times the defect rate at the conclusion of system testing.* 
The simplifying assumption here is that the customer's 
defect-finding efficiency is constant. Evidence to date 
suggests that a value of K = 50 might be a reasonable esti- 
mate of the acceleration factor. The estimated combined 
new and repeat defect-finding rate would then be 0.0008 
weighted defects per hour. 

New-Defect-Finding Rate. Considering the number of 
unique defects in the software is taking the perspective of 
the company; we want to know if code revisions are likely 
to be necessary. When considering repeat failures, we 
stopped on the exponential failure rate curve and extended 
a fixed horizontal line at the defect-finding rate observed 
at the end of system testing. To estimate unique failures, 
we continue down the 1(1) curve for the first year after 
release to shipment. Conversely, the cumulative number 
of defects found should continue up the m(t) curve. 

It is a trivial matter to estimate the number of unique 
defects the system testing team would find in an additional 

"See relerenco 1 lor another porspoclivo on acceleration lactors 
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t hours of testing. This is simply: 

m(T + t) - (total found so far) 

where T is the total number of test hours accumulated to 
date. In fact, the total number of defects remaining in the 
software can be estimated by 

a — (total found so far). 

For the Project B software. 1239 - 1072 = 167 weighted 
defects. 

An estimate of how many of these remaining defects will 
be found in the field can be accomplished by defining a 
new m(t) function for the customer. The a parameter can 
be set equal to the estimated number of defects remaining. 
The B parameter can be obtained if we are once again will- 
ing to use an acceleration factor. For the Project B software, 
the cumulative number of unique defects found after t 
hours of customer use is given by: 

m(t) = 167(1 - e- 0 " 0 " 484 " 5 "). 

For example, in 2000 hours of customer use we could 
expect 3.2 unique weighted defects to be found. 
Field Failure Rates in Perspective. Estimates of customer 
defect rates and defect totals should be taken with a healthy 
degree of skepticism. It is a risky proposition to propose 
acceleration rates and attempt to estimate customer ob- 
served defect rates. Experience to date clearly shows that 
our customers are not finding as many defects as estimated 
by the model. However, these two projects have not pro- 
vided sufficient insight for us to estimate confidently the 
relationship between model projections and customer ex- 
perience. After more products are released according to 
this process, comparisons will be drawn between the defect 
rate observed in-house and the subsequent rate in the field. 
An empirical estimate of the acceleration factor may be 
obtained in this way. To date, the results from these two 
projects are as follows. 

Recall that on Project A the analysis was conducted on 
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unweighted data — all defects were counted equally regard- 
less of severity. From the model, the number of unweighted 
defects estimated to be remaining in Project A is 2307 - 
2167 = 140. After 18 months in the field, there have been 
three customer reported defects. All of these have been 
resolved to the satisfaction of our customers. Three defects 
found out of an estimated 140 remaining suggests a ratio 
of estimated to actual defects of 47 to 1. 

As has been stated, the estimated number of weighted 
defects remaining on Project B is 167. After 12 months in 
the field, there have been four defects identified, all of 
which have been resolved in a subsequent release. After 
applying the same weighting criteria that was used during 
system testing, these four defects represent 2.4 weighted 
defects. Since this product has not been in the field as long 
as Project A. we could multiply by 1.5 to give 3.6 weighted 
defects after 18 months. The ratio of estimated to actual 
defects would then be 46 to 1 — amazingly close to the 
results from Project A but still conjecture at this point. 

Conclusions 

The defect-finding rate metric and the Goel-Okumolo 
model proved, on one project, to be a real contribution to 
managing the system testing phase of the software develop- 
ment process. Judging from the fit to historical data, the 
model would have been helpful on an earlier project as 
well. Through weekly plots of the data and the model, the 
management team received feedback on testing progress 
and information to aid resource planning. Ultimately, there 
was a clearer decision as to whether to release the product 
for customer shipments. 

In cases with limited test hours, the straightforward ap- 
plication of the Goel-Okumoto model would be to fit the 
m(t) model to the unweighted cumulative defect data using 
the method of maximum likelihood. If there are enough 
test hours to allow reasonable estimates of the instantane- 
ous defect-finding rates, fitting l(t) using least squares tech- 
niques is possible, although not superior to the maximum 
likelihood procedure. Based upon the results from these 
two projects, it appears that the modeling process works 
for both unweighted and weighted defect data. Extrapola- 
tions from the model to field conditions have proved to 
overestimate the number of defects customers will en- 
counter. Further study is required to estimate confidently 
the relationship between model estimates and customer 
observed failure rates. However, we are continuing to 
tighten the failure rate release criterion as new products 
enter final system test. 

An alternative approach to making the release-to-produc- 
tion decision would be to plot the mean time between 
finding new defects. l/l(t). This metric could then be used 
to make an economic decision on when continued testing 
would be too costly. 
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A Reliable, Autoloading, Streaming 
Half-Inch Tape Drive 

Designed for rack mounting, this compact tape drive 
cleverly channels air flow to load different-sized half-inch 
tape reels automatically. It also features higher performance 
and reliability than its predecessor. 

by John W. Dong, Kraig A. Proehl, Ronald L. Abramson, Leslie G. Christie, Jr., and Douglas R. Domel 



HP's NEWEST HALF-INCH TAPE DRIVE, the HP 
7980A (Fig. 1), is an autoloading, reel-to-reel, hori- 
zontally mounted, streaming drive that reads and 
writes in two standard nine-track formats: 6250 GCR and 
1600 PE. 1,2 This maintains compatibility with previous 
drives and tapes, an important feature because half-inch 
tape is still a significant standard in the computer industry 
for backing up, archiving, and interchanging computer 
data. 3 

The HP 7980A provides computer system users with a 
reliable, low-cost, backup device for large amounts of on- 
line disc memory. It does this with higher performance 
and reliability and lower cost than its predecessor, the HP 
7978B, which it replaces. It reads and writes at 125 inches 
per second, roughly 60 percent faster than the HP 7978B. 
The HP 7980A can rewind a 2400-foot tape in less than 90 
seconds, reducing overall data transfer times significantly. 



The HP 7980 A is 40 percent more reliable, a result of the 
increased use of VLSI components to reduce parts counts 
even further than was achieved on the HP 7978B. The 
monthly maintenance cost of the HP 7980A is half that of 
the HP 7978B. Horizontally mounted in a standard-width 
rack cabinet, the HP 7980A is 8.75 inches high, a third the 
size of the HP 7978B. This saves valuable floor space and 
allows better use of rack cabinets. 

The horizontal mounting means that the user normally 
cannot access the tape path to load the tape manually. 
Hence, an autoloading feature was designed into the HP 
7980A. The operator simply places the tape reel in the door 
opening and closes the door. The autoloading sequence 
starts automatically once the door is closed, leaving the 
operator free to do other tasks while waiting for the tape 
to load in a nominal time of half a minute. There is no 
need for an EZ-LOAD cartridge around the tape reel, which 




Fig. I. The HP 7980A Tape Drive 
is a hall-inch, reel-to-reel, 125-ips 
streaming tape drive designed lor 
systems with disc backup require- 
ments of greater than 400 mega- 
bytes, bke its predecessor, the 
HP 7978B, this high-performance 
drive operates with both 6250 
GCR and 1600 PE standard for- 
mats The HP 7980A can automati- 
cally load and thread any size reel 
ranging in diameter from six to 
10.5 inches. This drive can be 
horizontally mounted in a 19-inch- 
wide rack enclosure for better 
floor-space utilization, and it sup- 
ports IBM/ANSI-compatible for- 
mats for software distribution and 
data interchange between HP and 
non-HP systems. 
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is required for autoloading on the HP 7976A Tape Drive. 
The HP 7976A autoloads only 10.5-inch tape reels: the HP 
7980A autoloads all standard half-inch tape reels from six 
to 10.5 inches in diameter. The earlier HP 7978A<B Tape 
Drive must be loaded manually and cannot autoload tapes. 

In addition, the HP 7980A supports the use of 3600-foot 
half-inch tape, under certain guidelines, which the HP 
7978B doesn't. This increases data capacity by 50 percent 
over standard-length tape reels. 

The HP 7980A was developed and released in 40 percent 
less time than the previous tape drive. This was mainly a 
result of two factors. The first was keeping together an 
experienced core group of engineers from the HP 7978A/B 
development team to architect and design the HP 7980A 
drive. The second was concentrating on a core system de- 
velopment of the HP 7980A, that is. having a well-defined 
product and not adding additional features and configura- 
tions along the way. Such added features tend to prolong 
development cycles. This does not mean that these addi- 
tional features are not eventually added, but they are 
worked on according to need after the core or base system 
is released. 

Tape Path 

The HP 7980A is a totally integrated tape drive, simul- 
taneously incorporating a small form factor, an autoload 
feature, design-for-assembly concepts, low cost, and high 
reliability. The major design objective established to ac- 
complish these goals was design simplification. 

The HP 7980A has a very simple tape path as shown in 
Fig. 2. There are only two rolling elements: the speed sensor 
and the buffer arm roller. There is only one additional fixed 
tape guide. The oxide side of the tape contacts only the 
tape cleaner, the magnetic tape head . and the tape displace- 
ment unit. The tape displacement unit, located between 
the tape head and the tape cleaner, contacts the oxide side 
of the tape only during repositioning and while the tape 
is stopped. The tape displacement unit pushes the tape off 
the very smooth surfaces of the head and the tape cleaner 
to prevent the tape's sticking to these smooth surfaces dur- 
ing high temperature and humidity conditions. 

The buffer arm assembly (buffer arm, spring, and roller) 
helps take up slack in the tape during servo starts and 
stops. It also establishes the tension on the tape. The buffer 
arm roller and fixed guide, along with the speed sensor, 
guide the tape in a precise manner over the head. The 
speed sensor measures the velocity of the tape and feeds 
it back to the servo system. 

The half-inch tape reel is centered, seated, and locked 
by the supply hub. The autoload blower forces air through 
the door. The louvers in the door direct the air onto the 
tape reel to lift the end of the tape. The tape end is then 
carried by the air flow around the buffer arm roller and 
the fixed guide. It then goes over the tape cleaner and the 
magnetic tape head. The tape is finally sucked onto the 
take-up reel after passing around the speed sensor. 

The drive motors are located directly beneath and are 
attached to the supply and take-up hubs. The blower is 
located between the two drive motors. 



Integrated Autoload and Tape Path Design 

The HP 7980A tape path is very simple. This greatly 
reduces costs and assists the design of the autoload mech- 
anism, It simplifies the task of blowing the tape end off 
the supply reel and threading it through the tape path and 
onto the take-up reel. 

The casting is an integral part of the tape path and au- 
toload mechanism. The walls of the casting help determine 
the way the tape is blown around by the air flow created 
by tbe blower. The surface roughness of the casting is very 
important. The surface of the casting around the supply 
reel area must provide enough friction to permit the supply 
hub to autocenter the smaller tape reels. Conversely, the 
casting surface near the buffer arm assembly must be 
smooth enough so that the tape doesn't catch on any surface 
features while autoloading. The buffer arm shape is critical 
to autoload and servo success. The design of the air deflec- 
tor on the speed sensor is crucial to how well the tape end 
moves around the speed sensor roller and attaches to the 
take-up reel. 

The autoload success rate is greatly affected by the design 
of the door ramp and the door louvers. These are designed 
so the HP 7980A can autoload all standard half-inch reels 
with varying amounts of tape on each reel. 

Holes in the top cover eliminate the need for gaskets or 
tight tolerances on the door-to-bezel fit and the top cover-to- 
casting fit. They do this by eliminating air flow reversal, 
which can occur when autoload air flows out the door-to- 
bezel and top-cover-to-casting cracks, rather than down the 
tape path. The ribs on the top cover fit inside the casting 
to reduce air leakage and allow loosening of the top-cover- 
to-casting fit. 



Tape 
Displacement 
Unit 




Fig. 2. HP 7980A tape path components 
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The speed sensor is placed in the lape path for maximum 
tape wrap on the roller. This prevents the tape from slipping 
on the roller during tape acceleration and deceleration. 

The buffer arm and its spring are designed and placed 
so that tension variations caused by the tape coming off 
the supply reel at different angles relative to the arm are 
minimized. An additional roller could have been used to 
feed tape at a constant angle to the buffer arm. but was not 
put in for several reasons: cost, simplicity, and ease of 
autoloading the tape. 

Air System Design. The autoload air system was designed 
into the deck casting and the tape path from the beginning. 
Reduction of air ductwork was a major consideration. The 
less the air has to pass through winding passages, the lower 
the air pressure and volume requirements. For this reason, 
the tape path needs to be simple. Reducing the air pressure 
and volume reduces the autoload air blower's size, noise, 
and cost. 

The blower is placed between the two drive motors since 
that provides the shortest air passage. The air is sucked 
into the take-up reel, into the blower, and then blown out 
of the blower into the door area to repeat the path (see Fig. 
3). 

The door is used as an air passage for several reasons. 
No space is taken up by air ducting to blow air into the 
tape path, thus allowing the maximum amount of width 
for the card cage. In addition, using the door as an air duct 
allows the air to blow across the entire front of the lape 
reel for maximum design flexibility. The door louvers were 
then designed to blow air to autoload all standard reel sizes. 

The air velocity required to move the tape down the tape 
path was calculated from momentum and Bernouli equa- 
tions. The pressure drop was calculated in a similar man- 
ner. The blower was then selected based on these calcula- 
tions and the tape path, with the blower, was measured 
for pressure drops and velocities. The measurements were 
fairly close to the theoretically calculated pressure drops 
and air volumes. 

Autoload Algorithm. The HP 7980A autoload algorithms 
are designed to load all sizes of half-inch tape reels in a 
minimum amount of time, while making sure that the user's 
tape is treated with as much care as possible. The autoload 
process is a combined effort of firmware, mechanical de- 
sign, and electronic sensors. Four sensors are used by the 
autoload process to monitor its progress and to indicate 
when an error condition is present. The first of these sen- 
sors is the door sensor, which consists of two micro- 
switches connected in series that detect whether the front 
door or top cover is open. When this sensor detects a door 
closure, the autoload process begins. Opening the top cover 
at any point halts the autoload. Under the supply hub is 
the reel encoder sensor. This optical sensor detects the 
three reel encoder flags attached to the supply hub. These 
flags interrupt the sensor beam when a reel is properly 
seated on the supply hub. The tape-in-path sensor is an 
optical sensor which is positioned across the tape path 
preceding the head. The tape leader will interrupt this 
sensor beam when it enters the tape path. The final sensor 
is the optically encoded speed sensor. This sensor provides 
two quadrature pulse trains which are decoded to give 
position and velocity information. 



Once a door closure is sensed, the autoloading operation 
begins. The first task is to detect whether a reel is already 
threaded through the lape path, a condition that is most 
likely to be present after a power failure. This condition 
is detected by the tape-in-path sensor and by turning both 
the supply and take-up motors in opposing directions at a 
low rate. If a tape is already threaded, activity will be seen 
on Ihe speed sensor. In this case, the tape does not need 
lo be automatically threaded and the servo loops can be 
closed. 

If a tape is not already threaded, the load fan (blower) 
is turned on and the supply hub is slowly rotated in a 
counterclockwise direction to center and seal Ihe reel. At 
this point, the reel encoder sensor is checked continuously 
for evidence that the reel is seating itself on Ihe hub prop- 
erly. When Ihe reel is seated correctly, all three reel encoder 
flags will interrupt the sensor beam once per revolution. 
Time interval measurements are made to ensure I hat all 
three flags (not just one or two) are present, and thai they 
have the correct relationship. If all three flags are not de- 
tected, the supply hub is shaken back and forth quickly in 
an attempt lo get Ihe reel to locate itself properly on Ihe 
hub. Areel that will nol properly seat after shaking a second 
time will be rejected and a MISLOAD will be reported. 

The reel encoder sensor and flags are also used to regulate 
the speed of the supply hub during these open-loop oper- 
ations. The dc motors must turn at relatively low speeds 
when centering a reel and feeding the tape. These speeds 
require relatively low voltages at the motors. Because of 
voltage offsets, motor constants, and temperature changes, 
the rotational speed could not be adequately controlled by 
a single voltage command. Hence, the voltage command is 
constantly adjusted based on the time measured between 
each reel encoder flag pulse. This provides rather gross, 
but sufficient control over the rotational speed of the motor 
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Fig. 3. HP 7980 A tape path showing air flow for automatically 
loading and threading a tape. 
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in its open-loop mode. 

After the reel is found to be seated properly on the hub. 
it is ready to be locked into place. First, however, the tape- 
in-path sensor is polled to determine if the tape is being 
seen continuously in the tape path. If this is the case, the 
reel has been loaded upside down. When the reel is spin- 
ning counterclockwise, the tape should only be momentar- 
ily flopping into the tape path. If the reel is found to be 
upside-down, the supply hub is turned in the clockwise 
direction until the tape clears the path. The door is then 
opened and the INVERT message is reported. 

If the reel is not inverted, the hub locking routine can 
proceed. The supply hub motion is stopped and the hub 
lock solenoid is engaged. The hub is then rotated clockwise 
by applying a voltage ramp. As the hub rotates, three lock- 
ing feet come up out of the hub and grab the tape reel, 
holding it securely to the hub. During this locking opera- 
tion, the reel encoder flags and sensor serve as a check on 
whether the lock is a success. If the reel is properly locked 
on the hub, one of the encoder flags will interrupt the 
sensor beam as the voltage ramp is applied. After a success- 
ful lock, the hub lock solenoid is disengaged and the supply 
hub is free to spin again. This locking cycle is described 
in more detail later. 

The supply hub is now rotated again in the coun- 
terclockwise direction in preparation for the inertia check. 
The inertia check is used to determine the size of the reel 
being loaded. This information is used to set up some au- 
toload and servo parameters that can be optimized accord- 
ing to the reel size. The inertia check is performed by ap- 
plying a step voltage to the supply motor at the instant a 
reel encoder flag breaks the sensor beam. The time it takes 
for the next reel encoder flag to come around and break 
the sensor beam is proportional to the inertia of the reel. 

As the hub continues spinning counterclockwise, some 
additional error conditions are checked. The check for an 
upside-down reel is repeated. A check for the tape leader 
being stuck to the reel is done. In the same manner that 
tape continuously in the tape path signals an inverted reel, 
a tape that never breaks the tape-in-palh sensor beam indi- 
cates that the leader is stuck to the reel. This can often 
occur because of static electricity or because the leader is 
jammed under the reel flange. If the tape leader is indeed 



stuck, the supply reel is spun counterclockwise at high 
speed in an attempt to free the tape end. Failure to free the 
tape leader at this point will abort the load process. 

Next, the tape is ready to be threaded. The supply hub 
continues to spin in the counterclockwise direction, and 
the tape-in-path sensor is monitored for the tape leader. 
Once the tape leader is sensed in the tape path, the supply 
reel continues to spin for another half second (to pull the 
end of the tape back to the beginning of the tape path), at 
which point it reverses direction and starts feeding the tape 
down the tape path assisted by the air flow. The tape-in- 
path sensor is checked continuously to make sure the tape 
stays in the path during the feeding operation. If the sensor 
fails to detect the tape in the path, an error condition is 
flagged and the load is aborted. 

As the threading proceeds, the speed sensor is monitored 
for activity. If the tape has been correctly fed down the 
tape path and has caught onto the take-up reel, the speed 
sensor begins to spin. The speed sensor is then used to 
calculate the amount of tape that is wrapping around the 
take-up reel. If activity is not seen at the speed sensor 
within a certain period, or the required number of wraps 
are not completed within another certain period, the tape 
will be pulled back out of the tape path by the supply reel 
and the autoload will be retried. The HP 7980A will attempt 
to load the tape in this manner five times before reporting 
failure to do so. 

After it has been determined that the tape is properly 
threaded, the load fan can be turned off and the servo loops 
closed. The take-up and supply motors are driven slowly 
in opposition to tension the tape. The tension arm position 
is monitored by an analog-to-digital converter (ADC) and. 
once it has reached the center (OV) position, the tension 
loop is closed. The tension integrator is turned on and the 
loop is given about a half second to stabilize. Next, the 
microprocessor closes the velocity loop and digitally con- 
trols its operation. The tension shutdown circuitry is en- 
abled to prevent the drive from damaging itself or the tape 
during some sort of catastrophic failure. Any failure to 
establish tension (luring this process will cause the load 
operation to be aborted and a MISLOAD reported. 

The autoload process takes 30 seconds if all goes well. 
Then the drive can be put on-line and all normal reading 
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Fig. 4. Cross section of supply 
hub. The loot is rotated upward 
and outward from the supply hub 
by holding the hub fixed and rotat- 
ing the sleeve underneath. This 
causes the dumbbell-shaped tie 
rod to move from a slanted posi- 
tion to a position more upright, 
thus pushing the foot up and out. 
Reel encoder Hags pushed down 
by the presence of a reel are 
sensed by the optical sensor to 
verily proper reel loading. 
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and writing operations can be performed. 

Hub Lock Mechanism 

The first major objective in the autoload process is to 
determine the presence of, seat, and secure reels of tape. 
This process is complicated by the availability of four dif- 
ferent sizes of half-inch-tape reels ranging in diameter from 
six inches to 10.5 inches, the latter being the most common 
size. The tape reel is secured onto a supply hub which 
suspends the reel at the proper height, holds it secure, and 
rotates the reel. 

When the door of the tape drive is closed, the autoload 
algorithm starts immediately. The supply hub rotates in a 
clockwise direction and an off-center protrusion atop the 
hub contacts the inner race of the reel of tape. The rotational 
motion drags the reel in an inward spiral pattern until the 
reel eventually drops onto the hub's mounting plane. Three 
small plastic springs (the reel encoder flags mentioned ear- 
lier) mounted around the mounting plane are pushed down 
by the weight of the reel and trip the reel encoder optical 
sensor. The use of three springs assures planar contact of 
the reel on the hub, which helps prevent tape edge damage 
by keeping the reel as parallel as possible to the tape path. 
A fourth spring may be deflected by the write enable ring 
on the reel of tape to relay that status as well. 

At this point, the reel is secured to the hub by means of 
three feet that rotate up and out of the hub to clamp the 
inner race of the reel. The feet are driven by a four-bar 
linkage that translates the rotational motion of the hub into 
a vertical rotation of the feet. At the end of their travel, the 
feet are locked into position in much the same manner as 
a toggle switch with an overcenter design. 

The mechanism that drives the feet up and into place 
has four basic parts (Fig. 4). The first is the foot itself, 
which pivots about an axle swaged into the hub base. The 
foot is driven by a barbell-shaped tie rod which translates 
the horizontal rotation of the hub into the vertical rotation 
of the foot. The barbell, in turn, is driven by a sleeve-shaped 
part that fits around the base of the hub and is held in 
place by a spring and retaining ring. In normal operation, 
the sleeve rotates with the hub. During the reel locking 
stage, a solenoid engages an arm that stops the motion of 
the sleeve, and while the hub is rotated, causes the sleeve 
to rotate relative to the hub. The foot is pushed up and 
into place, depressing the wave spring when the foot con- 
tacts the tape reel. As the hub rotation continues, the barbell 
pops over center and starts to retreat. The relative motion 
at this point is ended by stops in the hub that limit the 
motion of the sleeve. The lock cycle is double-checked by 
the reel encoder optical sensors. Sensor placement is such 
that one of the plastic springs always interrupts the sensor 
at the moment the sleeve hits its stops. A neoprene pad 
embedded in the surface of the foot ensures that the reel 
does not slip relative to the hub. The hub is unlocked by 
simply reversing the process. In the case of a power failure, 
the entire lock or unlock cycle can be performed by man- 
ually turning the reel and engaging the arm. 

Many obstacles had to be overcome to ensure a successful 
design. The first of these was materials selection. The ma- 
terial for the plastic springs must have a high endurance 
limit, low creep at moderately high temperatures, and a 



consistent and high spring constant. The material chosen 
was ULTEM 1000 by General Electric. Material for the slid- 
ing parts on the hub was tailored to minimize wear and 
friction, and to resist deformation under load. The second 
obstacle was cost. We made extensive use of design for 
manufacturability to decrease part count and minimize as- 
sembly time. The result is a hub with no screws thai is 
completely assembled from one side. 

Integrated Tape Path 

All the tape path components, except for the reel motors, 
are mounted onto a precision head plate which is mounted 
on the deck casting. The magnetic tape head and the tape 
cleaner are permanently mounted onto the head plate in a 
manner similar to that used for the earlier HP 7978A Tape 
Drive." 1 This greatly reduces the number of tight tolerances 
required on the deck casting because of tape path require- 
ments. The buffer arm and speed sensor assemblies are 
removable. This allows for easy replacement of the head 
plate, speed sensor, and buffer arm assemblies. Each of 
these assemblies is designed to be modular and inter- 
changeable. There are no service adjustments on the HP 
7980A because any adjustments required are done at the 
factory. No routine maintenance is required, except for 
regular cleaning of the tape path. 

The tape path components perform several tasks in guid- 
ing the tape over the magnetic tape head. They make sure 
the tape is wrapped around the head properly and consis- 
tently. They also guide the tape past the head at a precise 
angle (skew) and at a proper height (tracking) so that the 
written tape is interchangeable with other tape drives. Nor- 
mally, there are two precision stationary or fixed guides 
that perform these functions. The HP 7980A tape path in- 
corporates one precision guide into the buffer arm assembly 
and the other guide is integrated with the speed sensor 
roller. 

Speed Sensor 

The speed sensor integrates the functions of an optical 



Upper 
Flange 

Roller 




Spring 



Optical ^ 
Encoder 



-Code Wheel 



Fig. 5. Cross section ol speed sensor 



40 HEWLETT-PACKARD JOURNAL JUNE 1988 



© Copr. 1949-1998 Hewlett-Packard Co. 



encoder and a precision tape guide. The particular chal- 
lenge was to create a rolling element, needed for transmit- 
ting tape speed to the encoder, that would guide the tape 
with the accuracy of a fixed guide. 

Fundamentally, a rolling guide is simply a vertical cylin- 
der with flanges on the top and bottom. The upper and 
lower flanges restrict the vertical limits of the tape's travel. 
According to the ANSI standard, the width of a half-inch 
tape must be between 0.496 and 0.500 inch. Clearly, the 
distance between the roller flanges must be at least 0.500 
inch not to damage maximum-width tapes, but this allows 
a 0.496-inch-wide tape to wander 0.004 inch between 
flanges. However, in the simplified tape path the speed 
sensor must also guide the tape for proper skew and track- 
ing across the head. This makes 0,004 inch of vertical wan- 
der unacceptable. 

A traditional solution of this problem is to put a fixed 
guide between the roller and the head. A fixed guide is a 
nonrotating cylinder with a top flange set to a precise 
height. A spring-loaded washer pushes the bottom edge of 
the tape so that the top reference edge of the tape stays in 
contact with the top flange. This eliminates tape wander 
since the distance between the washer and the top flange 
can vary with tape width. 

The HP 7980A speed sensor combines features of both 
rolling and fixed guides. Fig. 5 shows a cross section of 
the speed sensor. The roller, which includes a top flange, 
spins with the tape and transmits tape speed (through the 
shaft) to the optical encoder. The roller has no bottom 
flange, but there is a spring-loaded ceramic washer below 
the roller. The washer is restrained from rotating, but can 
move vertically to push the tape against the fixed height 
of the roller's upper flange. Thus, the roller is fixed verti- 
cally, but moves rotationally, while the washer is fixed 
rotationally and moves vertically. 

To keep the bottom of the spinning roller from rubbing 
on the non-spinning washer, the roller width is slightly 
smaller than the minimum tape width. Thus, the semicir- 
cular arc of tape wrapped around the roller pushes the 
ceramic washer down and away from the roller. The spring 
is placed under the center of gravity of this semicircular 
arc of tape. This balances the forces on the washer so that 
it remains horizontal and does not damage the bottom edge 
of the tape. 

Buffer Arm Assembly 

The design of the buffer arm assembly presented several 



special challenges. The buffer arm assembly must provide 
tape buffering, proper tape tension, a servo position signal, 
proper tape guidance, and an overtension shutdown signal. 
The buffer must be lightweight enough to maintain servo 
bandwidth, but strong enough to hold the tape height 
within a tenth of a millimeter. Because the HP 7980A is 
an autoloading drive, the buffer arm must also act as an 
air dam to load the tape correctly. To achieve these require- 
ments, a thin-wall aluminum die-cast part is used. Us flex- 
ibility allows an air baffle, a spring post, a stop, and a slot 
for overtension shutdown to be incorporated into a single 
part. 

During testing, it was discovered that the tape resonant 
frequency dropped when the drive repositioned frequently. 
It was determined that during such reposition cycles more 
air became entrapped in the tape stack. As a result, the 
effective tape length was increased, decreasing its spring 
constant. To overcome this problem, a Coulomb damper 
is used. A cantilever spring is placed between the buffer 
arm and fixed guide so that it produces approximately four 
inch-ounces of frictional torque. This dissipates enough 
energy from the system to allow servo stability without 
affecting other parameters. 

In a typical tape drive, the tape, after leaving the buffer 
assembly, enters a fixed guide for proper skew alignment. 
To minimize space requirements, simplify service, and re- 
duce parts count, the fixed guide is combined with the 
buffer assembly. The buffer arm pivots about the center of 
the fixed guide as can be seen in Fig. 6. The fixed guide, 
made of a stainless-steel ring and two ceramic washers, is 
bonded on the buffer base to a 0.015-mm tolerance. This 
allows the buffer assembly to provide all the tape guidance 
in the front half of the tape path, but still be removed by 
loosening only three screws. 

Another feature is the way the position of the buffer arm 
is sensed. The sensing assembly had to be small enough 
to pass through a 40-mm-diameter hole in the head plate, 
but require no adjustment if the buffer is replaced. To 
satisfy these requirements, a small ceramic magnet is 
mounted on the buffer shaft. Changes in the rotating mag- 
netic field are sensed by a linear Hall-effect IC. The plastic 
magnet holder allows each magnet to be bonded in its 
correct calibrated position. With this arrangement a signal 
that is linear within ±8% over the buffer arm range is 
achieved, requiring no further calibration for interchange. 
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Design (or Manufacturability 

The HP 7980A tape drive is designed to be built easily. 
The major improvement over the previous HP 7978B drive 
is its size. The HP 7980A is about a third the size and half 
the weight of the HP 797813. This makes it much easier to 
build and handle. 

All of the major components are attached to the main 
deck casting. The main deck casting is moved down the 
manufacturing line as major components and assemblies 
are attached to it. It is then flipped over to add components 
onto the top and tested. Most of the components and as- 
semblies are attached to the casting in a downward move- 
ment and fastened with a common self-lapping screw. 

Reduction of parts is another major improvement. The 
number of mechanical parts was reduced from approxi- 
mately 610 on the HP 7978B to 340 on the HP 7980 A. The 
number of unique mechanical parts was reduced from ap- 
proximately 260 to 140. All these factors help substantially 
reduce the time to build the HP 7980A. It is, therefore, a 
much more manufacturable machine than its predecessor, 
the HP 7978B. 
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Streaming Tape Drive Control Electronics 
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■ N DEVELOPING THE HP 7980A Tape Drive, the design 
I team leveraged its expertise gained from the develop- 
I ment (if previous streaming tape drives. Design goals 
included a low factory cost, a small form factor, an auto- 
loading capability, and a selection of interfaces. To achieve 
these goals, advancements were made in many key areas 
of tape drive design. These areas included the drive archi- 
tecture, controller software, read/write electronics, tape 
servo system, and front panel. 

Three-Box Control Architecture 

The HP 7980A Tape Drive incorporates a control architec- 
ture based on the three major functional blocks of a stream- 
ing tape drive: interface, buffer, and drive. Each functional 
block contains a microprocessor-based controller and data 
path electronics to handle user data. Fig. 1 shows the three- 
box architecture of the HP 7980 A. 

Interface. The interface block handles all communications 
with the host computer. The HP 7980A contains an HP-IB 
(IEEE 488) interface. Currently, two other interfaces (PfclR- 
TEC and SCSI) have been developed. Drives with one or 
the other of these interfaces are available as the HP 8878UA. 
Each interface accepts tape commands from the host, trans- 
fers the user data, and returns the appropriate status. To 
perform a task, the interface first relays the tape command 
to the buffer controller. User data is then sent to or obtained 
from the buffer memory. Upon completion of the task, 
status is obtained from the buffer controller. 
Buffer. The buffer block is responsible for streaming perfor- 
mance. The buffer controller oversees the use of Ihe 512K- 
byte buffer memory. The buffer accepts tape commands 
from the interface and lorwards them to the drive control- 
ler. The controller transfers user data between the interface, 
the buffer memory, and the drive's formatter. Status is ob- 
tained from the drive controller and forwarded back to the 
interface. Streaming performance is maintained through 
the use of immediate response on write operations and 
read-ahead on read operations. Immediate response on 
writes is implemented by the return of status as soon as a 



full record of data is accepted into the buffer, but before it 
is written onto the tape. This allows the host to free its 
memory and begin transferring the next record. When a 
write error occurs on the tape, the buffer will perform the 
necessary write retries independently of the interface and 
host computer. Read-ahead on reads is implemented by 
issuing additional read commands to the drive so that 
streaming is maintained until either the buffer is full, a 
command other than a read is received from the interface, 
or a read error occurs. This frees the host and interface 
from the constraint of performing all command and status 
processing overhead within the interblock gap time. Over- 
head processing time can be averaged out over several rec- 
ords. The buffer also performs all necessary read retries 
independently of the interface and host computer. Incorrect 
data is not sent to the host. 

Drive. The drive block executes all tape commands. The 
drive controller oversees the servo system, motor drive, 
tape path, formatter, read/write, and front panel. The drive 
controller accepts tape commands from the buffer control- 
ler and executes them. User read data is read from Ihe tape, 
corrected and checked by the formatter, and then sent to 
the buffer memory. User write data is accepted from the 
buffer memory, encoded by Ihe formatter, and then written 
lo the tape. Read-after-write occurs on all write commands 
to ensure error-free writes. Status is sent to the buffer con- 
troller. In executing a tape command, the drive controller 
must control tape speed, tension, and position. The drive 
controller relies upon the servo system eleclronics for 
closed-loop control of tape speed and tension. Tape posi- 
tioning is performed in conjunction with the read elec- 
tronics and formatter and is based on block boundaries. 
The drive controller also manages user requests and infor- 
mation through the front pane]. User requests requiring 
buffer and/or interface involvement are passed to these 
functional blocks. 

Each functional block is connected by a common com- 
munication link (CCL) to an adjacent functional block. The 
CCL encompasses common hardware and common soft- 
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ware definitions. All of the optional interfaces communi- 
cate with the buffer using the same CCL. The buffer com- 
municates with the drive using a similar CCL. The CCL 
software definition will be discussed later. 

The CCL hardware uses a message bus for the transfer 
of commands and status, and a data transfer bus for the 
transfer of user data to and from the tape. The message bus 
interfaces the controller of a master functional block (closer 
to the host) to a dual-port RAVI located on the controller 
of a subordinate functional block (closer to the drive). Fig. 
1 shows the location of the dual-port RAM for each CCL. 
This dual-port RAM is directly and fully addressable by 
the microprocessors of both controllers. Commands and 
status are passed between the controllers by writing and 
reading specific memory locations. The data transfer bus 
interfaces the data path of a master functional block to 
FIFO (first-in, first-out) buffers located on the data path 
of a subordinate functional block. Separate inbound and 
outbound FIFOs are used to facilitate loopback testing. User 
data is passed along the data transfer bus in a sequential 
manner. Odd parity is maintained throughout with parity 
checking occurring as data is received across a functional 
boundary. An end-of-data bit is asserted to flag the last 
byte of each data record. 

Interfaces 

Important to the tape drive's acceptance by OEM cus- 
tomers (as the HP 88780A) is a selection of interfaces. The 
drive currently supports HP-IB, SCSI, and PERTEC. In the 
development of these interfaces, common hardware and 
software designs were shared. All three interfaces contain 
a 6809 microprocessor-based controller. About 30 percent 
of the circuitry is common to all three interfaces. As men- 
tioned above, a common communication link definition is 
used to connect an interface to the buffer. Each interface 
completely isolates the rest of the drive from host, interface, 
and protocol specific requirements. By developing these 
interfaces in parallel it became clear what functionality 
belonged in the interface as opposed to the buffer or drive 
functional blocks. 

The development of three sets of interface code in the 
time allotted for one was a major undertaking. There are 
24K bytes of HP-IB code, 40K bytes of SCSI code, and 17K 
bytes of PERTEC code. Design time was reduced through 
the use of a common firmware architecture and the leverag- 
ing of common code. These interfaces share the same top- 
level firmware structure. About 6K bytes of code is common 
among the interfaces. The common code contains dual-port 
RAM communication routines and command processing 
of front-panel requests including diagnostics. With the ex- 
ception of a power-on routine and a table-access module, 
all of the code is written in high-level languages. The use 
of high-level languages increased design efficiency and 
code readability. Overhead processing time became an 
issue with the PERTEC interface. Instead of rewriting the 
code in assembly language, overhead time was removed 
by rearranging the order of tasks during reads and writes. 
An increase in parallelism between interface and buffer 
was obtained, saving about two milliseconds per command. 

The three-box architecture gives the HP 7980A and HP 
88780A designers flexibility. Not only can new interfaces 



be easily designed and supported, but also options and 
enhancements can be added easily. For example, a new 
buffer design can be integrated without affecting any of the 
interfaces or the drive. Parallel development, debug, and 
test of the functional boxes increased design efficiency by 
minimizing the impact that problems in any one area had 
on the entire project. Because these boxes are functional 
entities, they could be turned on and debugged separately. 
Separate turn-on and debug included functional testing as 
well as the testing of communications with neighboring 
blocks. The integration of the first complete HP 7980A 
produced a working drive within a month. 

Common Communication Link Software 

The common communication link (CCL) is a single inter- 
face definition for two separate interface links. The HP 
7980A contains three processors, a host interface control- 
ler, a buffer controller, and a device controller, performing 
specific tasks. The main function of the buffer controller 
is to intercept read and write commands and buffer the 
data within its data buffer to provide fast data access. For 
operations not involving data buffering or command queuc- 
ing, the buffer controller typically passes the operation 
(command) directly through. By having common interface 
links on both the interface and the drive, the complexity 
of executing unbuffered commands is greatly reduced. 

The CCL is designed to have a common command set 
for the half-inch tape format. Although the HP-IB interface 
is the primary host interface for the HP 7980A, the CCL 
command set does not preclude support of other interfaces 
such as SCSI, PERTEC, or a future high-performance inter- 
face. The buffer controller and drive controller implement 
a superset of commands that represents the operations 
needed to support most half-inch tape interface protocols. 

CCL protocol is implemented by a lK-byte dual-port 
RAM. This dual-port RAM is accessible by both the master 
(closer to the host CPU) and the subordinate (closer to the 
drive). Access is controlled by message control locations 
which synchronize the passage of messages across the CCL. 



Dual-Port RAM Format 
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Fig. 2. Dual-port RAM use. 
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The dual-port RAM is divided into two command ports, 
a diagnostic message port, a status area, two interrupt con- 
trols, and two interrupt locations (see Fig. 2). The command 
ports support bidirectional command transfers. A master- 
initiated command port passes commands from the master 
to the subordinate, while the subordinate-initiated com- 
mand port passes commands from the subordinate to the 
master. The master-initiated command port typically exe- 
cutes commands generated by the host, while the subordi- 
nate-initiated command port typically executes commands 
generated from the front panel. 

A command port contains one of two message types: a 
command sent from the initiator or the report returned by 
the target. The contents of the command port are deter- 
mined by a message control location. 

The diagnostic message area is used during certain diag- 
nostic operations when data messages larger than the com- 
mand ports (64 bytes) need to be passed. A diagnostic mes- 
sage can be up to 512 bytes long, followed by a two-byte 
checksum. 

The status area is used to maintain information un the 
drive that changes infrequently. This area contains infor- 
mation such as tape loaded, write protected tape, and the 
density of the current tape. 

Thfi interrupt control locations are used to specify areas 
of the dual-port RAM that require that an interrupt be sent 
notifying the other controller of any change in their con- 
tents. Interrupts are most often requested on changes to 
the command port message controls (indicating that a com- 
mand is ready or a report has been returned) and changes 
to the status area. The use of programmable interrupts gives 
more freedom for each controller's implementation. The 
interrupt locations allow the master and subordinate to 
interrupt each other independently. The interrupt location 
is written with a value that indicates the area of dual-port 
RAM that has been changed. 

The CCL commands are divided into six major opcode 
ranges: write, read, space, general, configuration, and diag- 
nostic commands. Each command range is further sub- 
divided into the individual commands supported within 
that group. For example, the write command group consists 
of write data record, write file mark, write gap, and write 
density ID. The opcode ranges simplify command decod- 
ing. Parameters on CCL commands are dependent on the 
opcode and vary in size from none to 57 bytes. For most 
opcodes, the number of parameter bytes is fixed. For ex- 
ample, the write record request opcode contains one three- 
byte parameter specifying the maximum length of the data 
to be sent, the write tape mark opcode contains no param- 
eters, and the write density ID contains a one-byte param- 
eter specifying the density to be written. 

The report type returned on a CCL command depends 
on the command sent. All tape commands return a normal 
report containing status flags, tape position flags, a byte 
count field, a retry count, and an error code. The status 
flags quickly tell whether the command was successful, 
successful with error recovery, unsuccessful, or rejected. 
The tape position flags indicate the current position with 
regard to beginning of tape (BOT), end of tape (EOT), and 
end of file (EOF). The byte count contains the length of a 
write record or read record. The retry count indicates 



whether physical retries were needed during the operation, 
and the error code contains detail on a command failure 
or command rejection. 

Non-tape commands return report types specific to their 
operation. For example, the diagnostic test commands re- 
turn a diagnostic report that can contain the test number 
that failed, the failure code, and up to two field-replaceable 
units (FRTJs) that could have caused the failure. 

High-Performance Buffer Software 

The HP 7980A is a high-performance streaming tape 
drive. It is also very flexible in its ability to be adapted to 
different environments and to use performance features in 
a variety of ways. High performance and flexibility of use 
in the HP 7980A are achieved by the features provided 
within the data buffer. 

A streaming tape drive mechanism such as the drive 
controller requires a constant stream of data if it is to main- 
tain high performance. In a typical host-drive interaction, 
irregular or slow data rates can interrupt the constant 
stream of data and cause the tape to stop and perform a 
costly repositioning cycle. The data buffer addresses the 
data rate problems by providing an asynchronous link be- 
tween the interface and drive, allowing each to operate 
independently at its own data and command rates for lim- 
ited time intervals. The HP 7980A has a large buffer in 
which to hold data as data rates are matched, making the 
asynchronous link possible. The buffer holds 512K bytes 
of data, enough not only to match variations in data rates, 
but also to allow the data buffer subsystem to accept data 
throughout a repositioning cycle at an average rate of 400 
kilobytes/second. 

The data buffer packs variable-sized records into its buf- 
fer and places an entry in a queue indicating the location 
and size of the corresponding record within the data buffer. 
The queue is large enough to allow the buffer to be com- 
pletely filled without reaching queue limitations except on 
small records. 

The buffer maintains two independent processes, one 
that interacts with the drive and another that interacts with 
the interface. Any transactions the host computer makes 
with the HP 7980A Tape Drive are made with the interface 
side of the data buffer. The host transaction is complete 
when data has passed between the host and the buffer and 
necessary reporting and status information has passed. The 
operations necessary to write or read the data from the tape 
occur independently of the host interactions. 

During write operations data passes into the buffer from 
the host. As soon as data is in the buffer, the command is 
reported on with an immediate response report indicating 
that the operation is complete. The queue contains the size 
of each record. Data held in the buffer is then written to 
the tape automatically when a sufficient amount of data 
has been accumulated or when an appropriate time-out 
has occurred. The drive is streamed as long as data is avail- 
able in the buffer. 

When a read command is received from the host, the 
buffer goes into a read-ahead mode during which the drive 
side of the buffer controller issues continuous reads to the 
drive controller. Data passes into the buffer and reports are 
placed into the queue. This time they contain not only 
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record size, bul also information regarding the success of 
the read attempt. Data is taken from the buffer and reports 
are taken from the queue to satisfy all read commands from 
the host. As data is removed by the interface, the drive tries 
to keep the buffer full, managing tape motion as necessary. 

The queuing/buffering algorithm implemented in the HP 
7980A allows a great deal of flexibility with the contents 
of the buffer. When data is read from the buffer, pointers 
are moved forward in both the data buffer and queue. Until 
the information is physically overwritten, it is possible to 
recover the information by moving the poinlers back again. 
This capability is built into the read-ahead functions of the 
buffer controller. If a backspace command is received while 
the drive has been performing read-aheads. the buffer first 
attempts to recover the data by moving pointers back. This 
electronic backspace, as it is termed, gives a great perfor- 
mance advantage in that it eliminates the need to reverse 
the tape physically to position back one record. 
Increased Buffer Flexibility. The data buffer has other fea- 
tures that provide greater flexibility in the use of its perfor- 
mance capabilities. Two of these features are greater access 
to buffer functions and the ability to change key performance 
parameters. Flexibility is also obtained through the use of 
manual buffer operations and through configurability. 

Buffer operations described to this point involve two 
sides oi the buffer. The drive side performs either delayed 
or anticipatory actions automatically to provide quick re- 
sponse to the host. Data is disposed of according to conven- 
tions in streaming conditions. 

Manual commands offered by the buffer provide the same 
capabilities that occur automatically during a read or write 
operation, with each operation broken down into subopera- 
tions. During manual operations, only the specified sub- 
operations are performed, with no automatic actions taken 
and with data disposition specified, not assumed. For 
example, a single write command is broken down into 
write-to-buffer. write-to-tape, and remove-record-from- 
buffer suboperations. ManuaJ buffer operations are much 
more cumbersome to use and are not optimized for high- 
performance streaming, but offer flexibility in certain key 
areas such as diagnostics and in recovering buffered data. 

Diagnostics make heavy use of manual commands. Data 
loopback operations are performed by writing to the buffer 
and then reading data back to the interface. The drive pro- 
vides wellness and error rate tests using manual commands 
to write and read tapes locally. Diagnostic access to manual 
commands also allows access to buffer and tape operations 
from the front panel. 

Recovering buffered data also uses manual commands. 
When a hard error occurs, or if the tape is stopped at EOT 
with unwritten data, the data in the buffer can be recovered 
by issuing read-from-buffer commands. Data recovery is 
not a typical operation, but is possible using manual com- 
mands. The flexibility of manual commands also allows 
current and future interfaces to implement interface spe- 
cific operations. 

The performance of the drive is dependent not only on 
the large buffer, but also on how the drive is set up to use 
the buffer. For example, parameters such as the trip point 
at which the drive starts writing data out of the buffer to 
the tape, the maximum record size to be written, or the 



length of time writes are held can all affect performance. 

The HP 7980A is a highly configurable drive. Most vari- 
ables that can affect performance and many that determine 
the personality of the drive can be configured and main- 
tained within nonvolatile RAM on the buffer board. Config- 
urations are used by the drive controller, the buffer control- 
ler, and the interface controller. A section of configurations 
is maintained and defined differently for each interface. 

The configuration values are distributed to the appro- 
priate subsystems when the drive is powered up. The inter- 
face also maintains default ROM configuration values. In 
the event that battery backup power is removed from the 
nonvolatile memory, default values are obtained from the 
interface. 

Through configurations, a drive can be set up with a 
distinct personality and set of performance features for a 
particular user. The configuration system also maintains a 
set of locks which can be used to lock the configuration to 
the current value and prevent it from being changed. 
Concurrent State Machines. The data buffer holds a special 
position within the drive in that it maintains CCL com- 
munications with four ports, three of them entirely inde- 
pendent. Each port has its own characteristics and needs. 

The downstream interface port is the source of all host 
commands. Commands received here must be processed 
quickly since all streaming operations into and out of the 
buffer occur here. Streaming commands also have the 
characteristic that when one is received, the next command 
will probably be the same. 

All streaming commands pass to the drive through the 
downstream device port. Quick response at this port is 
critical. If a command is not sent to the drive when a report 
is received, the drive controller may be forced to stop 
streaming and reposition the tape. 

The upstream drive and interface ports are not indepen- 
dent since there is no queuing of upstream commands. 
Commands received from the drive are either fully or par- 
tially processed by the buffer controller. If necessary, they 
are then passed up to the interface controller. Commands 
received from the drive are irregular and are typically ini- 
tiated as a result of human interactions. Hence, processing 
speed only needs to be fast enough for human response. 

Resets can also be received as asynchronous events from 
either the drive or the interface. Resets are infrequent and. 
like upstream commands, do not require fast processing 
response like downstream commands. 

The processing needs of the different ports led to an 
architecture within the buffer controller that allows 
maximum independence and specialization for the pro- 
cessing of each of the ports. Each major input source is 
handled by a separate program which controls the source 
and can be tailored to meet the streaming or background 
needs of each port. An operating system provides a set of 
concurrent processes for the programs to run within. Each 
program runs independently within its own time slice, 
with time apportioned for critical needs as they arise. 

The programs are implemented as state machines and 
the operating system as a concurrent state machine driver. 
State machines have the characteristic that they perform 
an action when a particular event occurs, then proceed to 
the next state and wait for another event to occur. State 
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machines can be controlled effectively by an operating sys- 
tem. The operating system performs state transitions and 
calls to state execution modules. It also allocates time 
appropriately between the different processes during the 
waiting periods. Within the time slice given to each pro- 
cess, each of the potential events for the current state can 
be checked. If an event has occurred, the associated action 
for that event is also taken. Actions are coded as straight- 
line operations without loops. At the conclusion of an action, 
processing returns to the operating system for state transi- 
tions and for other processes. 

The interface slate machine handles all communication 
between the downstream interface port and the queue and 
buffer hardware. The drive state machine handles down- 
stream communication between queue and drive port, in- 
cluding all automatic retries of tape operations. The subor- 
dinate state machine handles upstream commands in both 
the interface and drive ports. The reset state machine han- 
dles all resets regardless of location. 

The operating system has no watchdog timer nor the 
associated timing and communication race conditions as- 
sociated with unexpectedly losing the processor to another 
time slice. Instead, the operating system allows each pro- 
cess to complete its action before passing control to the 
next process. Time for the heavy needs of the interface 
state machine and the drive state machine is automatically 
allocated by allowing the processes to take the time that 
they need. Processes such as the subordinate state machine 
and the reset state machine are effectively turned off by 
having only a single event to check on. They have no com- 



plexity until the event occurs. 

Each of the processes is independent, but each can have 
communication with other processes and affect their oper- 
ation by passing messages or commands. All communica- 
tion between processes occurs as events in the receiving 
process caused by actions in the sending process. Com- 
munication locations are tightly controlled data structures, 
the queue being a prime example. The receiving process 
looks at the communication locations for conditions that 
trigger events in themselves and that require action to be 
taken. 

The buffer controller functions are divided into indi- 
vidual programs, which are independent, have their own 
specialization and definition of tasks, and have localized 
access to hardware, ports, and internal communication lo- 
cations. This provides the structure needed to tackle the 
complex tasks of the buffer controller. Each of the pieces 
has defined functions and boundaries forgood process defi- 
nition and implementation. 

Integrated Read Write System 

The design objective for the HP 7980A read/write system 
was to lower cost while improving performance over pre- 
vious tape products. This objective was met through the 
reduction of printed circuit board space, the real-time op- 
timization of read gains, the calibration of each read chan- 
nel, and the simplification of the write electronics. 

The HP 7980A read/write board replaces the equivalent 
functions of earlier read, write, and write formatter boards. 
This reduction in printed circuit board space was achieved 
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by using surface-mount components, combining the pre- 
amplification and differentiation functions into one stage, 
using an integrated disc pulse detector chip, and develop- 
ing a semicustom phase-locked loop chip. 

The read channel shown in Fig. 3 consists of a differen- 
tiating preamplifier, a gain-controlled amplifier, an auto- 
matic gain control circuit, a gap clamp circuit, a zero-cross 
pulse circuit, a zero-cross qualifier circuit, a calibration 
circuit, and a fourth-order Bessel filter. The input to each 
channel is the output of a single-track read coil. Two sig- 
nals, a zero-cross pulse ZCP and a zero-cross qualifier ZCQ, 
make up the output of each read channel. The zero-cross 
pulse is generated each time the differentiated input signal 
passes through zero. The zero-cross qualifier goes to a log- 
ical zero when the differentiated input signal becomes more 
negative than the lower hysteresis level and goes to a logical 
one when the differentiated input signal becomes more 
positive than the upper hysteresis level. This information 
is used by the phase-locked loop to recreate the digital data 
as it was recorded. 

Differentiating Preamplifier. The preamplifier is an NE592 
differential video amplifier configured as a differentiator. 
The 200-kHz (PE format) gain is set for 32 dB with a phase 
shift of 88 degrees, and the 555-kHz (GCR format) gain is 
set for 40 dB with a phase shift of 86 degrees. 

A typical read channel has a preamplifier stage followed 
by an AGC amplifier and differentiating amplifier. To re- 
duce board space and parts count, the preamplifier and 
differentiating amplifier are combined. The layout of the 
differentiating preamplifier is very critical to achieving 
high common-mode rejection and retaining a high signal- 
to-noise ratio. Fig. 4 shows the difference between the con- 
stant gain and the differentiating configurations for the 
NE592. In the differentiating configuration, any leakage 
capacitance from the junction of capacitor Cl and resistor 
R1 results in the two emitters seeing different impedances 
to ground. This unbalances the differential front end of the 
NE592, resulting in a decrease in common-mode rejection. 
The board layout techniques shown in Fig. 5 are used to 
reduce the noise level and retain the common-mode rejec- 
tion of the NE592. The read and write sections have board- 
level isolation of 5V. ±12V. and ground. Each of the read- 
channel NE592 front ends has individual paths to the bot- 
tom ground plane, individual lines for power, and mini- 
mum trace runs near the differentiating preamplifiers. The 
head signals from the J2 connector to the NE592 inputs 
alternate between layers 2 and 3 which are located between 
ground plane layers 1 and 4. 

Disc Pulse Detector Chip. The preamplifier is ac coupled 
to the AGC amplifier, which drives a differentia] input, 
fourth-order, passive Bessel filter. The output of this filter 
drives the AGC, zero-cross pulse, and zero-cross qualifier 
circuits. The AGC circuit measures the input signal and 
compares it against the AGC level (an external dc voltage). 
This voltage difference controls the gain of the AGC 
amplifier to make the peak-to-peak differential voltage out- 
put of the Bessel filter equal to four times that of the AGC 
level. The differentiating preamplifier turns peaks into 
zero-cross data so that a comparator can be used to locate 
the read signal peaks. If the read signal exhibits a tendency 
to return to the baseline between peaks, the comparator 



could respond to noise near the baseline. To avoid this 
problem, a zero-cross qualifier circuit is used. This circuit 
is a comparator with externally controlled hysteresis. The 
differential signal must go above or below the dc hysteresis 
value before the comparator will switch states. The com- 
parator output is fed to the phase-locked loop as zero-cross 
qualified data. The output of a bidirectional one-shot is 
fed to the phase-locked loop as zero-cross data that indi- 
cates the start of a data window. The phase-locked loop 
uses the zero-cross data to generate a pulse that is half a 
bit window long. This pulse is used to clock the zero-cross 
qualified data into a latch. The output of this latch repre- 
sents the data as it was put on the tape. 
Automatic Calibration. During track activity (data records), 
each track independently adjusts its gain via the AGC volt- 
age to an optimum amplitude. The AGC voltage required 
by each channel while reading data is determined using 
comparator Ul and DAC U2 as shown in Fig. 3. During 
gap conditions (no track activity), the AGC voltage is over- 
ridden by individual gain control DACs. This allows each 
channel to be preset to a gain that produces an output 
signal of optimum amplitude if a nominal input signal is 
present. Detection of data by the phase-locked loop signals 
the channel to enter the AGC mode until another gap is 
detected or the gain increases above a predetermined value. 
DAC U2 and comparator Ul in Fig. 3 are used to determine 
when the gain is above the predetermined value. During 
writes the gain is also held constant so that low-amplitude 
read signals cause a write retry. 

Track-to-track variation in the AGC voltage is dependent 
on the head and read channel combination and is indepen- 
dent of tape-to-tape variation. Track-lo-track variation (gain 
profile) from a high-output tape to a low-output tape can 
be closely modeled using a single gain offset value across 
all tracks. Gain profiles of GCR and PE densities differ, and 
therefore require separate gain profiles for each density. 
Automatic calibration is performed at the beginning of each 
tape when the tape density is being identified or being 
written. This calibration in GCR involves all nine tracks 
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Fig. 4. Constant gain (left) and dv/dt (right) configurations 
tor NE592 preamplifier 
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during the ARA burst portion of the GCR density ID. while 
in PE it involves only the PE density identification track. 
Reference DAC 112 in Fig. 3 is stepped upwards until com- 
parator Vi indicates the reference voltage is equal to the 
AGC voltage for this track. This procedure is repeated two 
more times and the median voltage for each track is saved 
as one of the raw gain voltage levels for the current tape. 
The average gain of the raw gain voltage levels is calculated 
and saved as one of the average gain voltage levels for the 
current tape. A gain profile calibration is performed using 
the tape drive diagnostics. An all-ones pattern at the appro- 
priate density is written for 100 feet beginning at BOT. The 
pattern is read and the gain voltage for each track is deter- 
mined 256 times. The average gain voltage for each track 
is the gain profile for the specific density. Each time the 
average gain voltage level is calculated during a tape ID 
routine, the gain profile for the specific density is nor- 
malized such that its average value across all nine tracks 
matches the average gain voltage level for the current tape. 
Standard Cell Phase-Locked Loop. When we decided to do 
a follow-on design to the HP 7978 A Tape Drive, our experi- 
ence with its design afforded us a great opportunity to 
make technological advancements in the areas of price, 
performance, and reliability. With this in mind, we selected 
areas of the HP 7978A design where we could get the best 
return using some sort of LSI integration. One area was in 



the clock recovery section of the HP 7978A. 

Designing a multidensity phase-locked loop for a nine- 
track tape drive presents a problem when one is trying to 
minimize cost and maximize performance. However, with 
this as a design criterion, it was evident that some sort of 
integration was needed. What we decided on was a digital' 
analog hybrid phase-locked loop design. This gave us the 
flexibility to set important third-order loop parameters 
using normal analog loop parts, that is. an op amp. filter, 
and VCO. We then wanted to use LSI technology to inte- 
grate all the digital logic associated with the recovery 
scheme. We chose HP's own CMOS standard cell process 
whose density allowed us to put three tracks of phase- 
locked loop logic into one chip, and thus use only three 
chips per system. 

The major elements of the standard cell are shown in 
Fig. 6. Two programmable digital delay lines are used in 
the multidensity design, one for qualification of incoming 
read channel flux data, the other for calculating the phase 
error to be used by the phase detector. A pattern detector 
is used to detect the presence of data to control the phase- 
locked loop as well as the automatic gain control circuitry 
of the read channel. A two-state phase detector that can 
switch between frequency-lock and phase-lock modes vir- 
tually eliminates the chances of a harmonic lockup. An 
output encoder is used to synchronize all the channel out- 
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Fig. 5. The layout of the ground planes lor the NE592 Iront ends is critical lor reducing noise 
and maintaining common mode rejection. 
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puts with a [Simmon system clock. The bus interface section 
contains four control registers for full microprocessor con- 
trollability. A center-frequency generator provides a refer- 
ence when there is an absence of data. 

The standard cell is implemented using HP's CMOS pro- 
cess. The implementation and testing of parts took only 
six engineer-months. Our first silicon version was produc- 
tion quality. Because the chip was flexible and usable the 
first time, we designed it into the HP 7978B as a cost-reduc- 
tion action before the HP 7980A was introduced. The cost 
savings achieved in the HP 797HB was enough to pay for 
the chip's development cost. The gains achieved in the HP 
7980A over the original HP 7978A phase-locked loop in- 
clude a reduction in printed circuit board area by a factor 
of 10, an improvement in reliability by a factor of 8. a 
reduction in factory cost by a factor of 5. and an increase 
in performance by a factor of 2. 

Digitally Controlled Write. Each oi the nine data signals 
from the write formatter is turned into two write currents, 
each driving one half of the write coil for one track. The 
positive half of the write current is controlled by the signals 
GCR, PE, and COMP while the negative half is controlled 
by the signals GCR", PE', and COMP*. These six signals are 
generated for each channel by a PROM which is driven by 
common address signals comp, bp g", and clack. The other 
two address lines, data-1 and data-2. are channel dependent. 
data-2 lags data-1 by one bit window time period, allowing 
the PROM to know the value of the preceding data bit. For 
data equal to one. GCR is asserted for the full bit window 
time. PE is asserted for 100 percent of the bit window time 
in PE mode but for only 16 percent of the bit window time 
in GCR mode. If the preceding bit was a zero, COMP is 
asserted for 80 percent of the bit window time in GCR 
mode or for 90 percent of the bit window time in PE mode. 
For data equal to zero the control signals GCR" and PE" are 
asserted instead of the signals GCR and PE. If the preceding 
bit was a one. COMP - will be asserted. 



Each track has two groups of three control signals. These 
signals control a group of three open-collector drivers. The 
open-collector drivers are connected to one end of a write 
coil through current-limiting resistors. The center tap of 
each write coil is connected to 5V. The half-coil current is 
then the sum of the three currents generated by each open- 
collector driver pulling its resistor to ground. This tech- 
nique allows the write currents to be generated using 5V 
TTL logic. 

To guarantee the write and erase currents will turn on 
only when commanded to, a POWER GOOD signal is hard- 
wired to a protection circuit. The POWER GOOD signal is a 
signal from the power supply which will not be valid until 
voltages are at rated values. The protection circuit turns 
on an enhancement mode n-channel MOSFET which 
supplies live volts to the center taps of the nine write coils. 

The same type of circuit is used in the erase circuit to 
turn on an npn transistor which provides a ground for the 
erase circuit. The dc erase current must ramp to the steady- 
state current in less than 0.2 ms and stay at this value until 
turned off. The erase current is controlled by two circuits. 
The two circuits operate in parallel until the steady-state 
current level is reached. The ramp circuit is then turned 
off. leaving the steady-state current circuit on until the 
erase circuit is turned off. 

Hybrid Servo System 

The HP 7980A servo system loads and unloads the tape 
automatically, controls the tape velocity and tension, and 
performs diagnostic functions. The drive controller uses 
the servo system in conjunction with the read electronics 
and formatter to perform accurate control of tape position. 
The servo moves the motors independently during the tape 
loading process under the control of a 6809 microprocessor. 
Sensors monitor the progress of the tape and check for 
error conditions. Tension is established and the velocity 
and tension servos are enabled. The tape is under closed- 
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loop control thereafter. The tape velocity is primarily con- 
trolled by the take-up motor, and the tension is controlled 
only by the supply motor. A block diagram of the servo 
system is shown in Fig. 7. 

Closed-Loop Operation. Tape velocity is controlled by an 
integral plus proportional control scheme. This yields a 
zero steady-state velocity error. The long-term tape velocity 
error is well under one percent, The controller is im- 
plemented as a digital-analog hybrid to take advantage of 
the best features of each type of control. In the HP 7980A. 
a design requirement for the servo is that the 6809 micro- 
processor be able to manage all the drive control functions 
and also control the velocity with minimal involvement. 
For this reason, the error signal and integral portion of the 
loop are performed by the processor, and the proportional 
feedback and multiplication by coefficients are performed 
by analog circuits. An optical position encoder is used by 
the processor to measure linear tape velocity. This is done 
by counting the number of pulses of the encoder every 5 
ms. The measured velocity is compared with a velocity 
command at each sample period and accumulated over 
time. The resulting value is sent to a 12-bit UAC. The pro- 
cessor has complete control over the velocity of the tape 
by changing the value of the velocity command. This 
scheme performs the integral portion of the control with 
high accuracy. The processor involvement is minimal be- 
cause only simple fixed-point additions and subtractions 
are required. The proportional part of the control is done 
in analog form. The optical encoder drives a tachometer 
circuit which gives a bipolar voltage proportional to tape 
velocity. The tachometer circuit does not have to be very 
accurate since it is only used to generate a proportional 
feedback term. The velocity control accuracy is determined 



by the integral term computed by the processor. The inte- 
gral and proportional terms are summed with appropriate 
scaling factors by an operational amplifier and the resulting 
signal is then sent to the motor amplifier. 

The output voltage of the velocity controller drives the 
take-up and supply motors. This reduces buffer arm deflec- 
tions during velocity ramps. If the velocity and tension 
servos were completely decoupled, the velocity servo 
would start a ramp with the take-up motor, and the ten- 
sion arm would then deflect and start a ramp of the supply 
motor. This would require large arm deflections. In the HP 
7980A the velocity control voltage is scaled and sent to 
both motors. The buffer arm then only needs to deflect a 
small amount to correct the tape velocity at the supply 
motor. 

The variable-velocity rewind is an important feature of 
the HP 7980A tape drive because it allows a reduction of 
rewind time for a 24110-foot reel from 120 seconds for a 
constant-velocity drive to 90 seconds. The maximum re- 
wind velocity attainable by a tape drive primarily depends 
on the maximum voltage available to drive the motors. 
Each motor generates a back emf proportional to its angular 
velocity. This back emf counteracts the supply voltage and 
limits the maximum angular velocity the motor can 
achieve. 

The tape radius of this tape drive can vary over a 2:1 
ratio from beginning of the tape to its end. At each end 
one or the other motor can limit the maximum rewind 
velocity attainable to about 250 ips. In the middle of the 
rewind, the tape radius for both motors is roughly equal, 
and the tape can attain velocities of over 400 ips. In the 
HP 7980A, the processor maximizes the velocity of the 
drive during rewind by reading the voltages sent to the 
motors with an ADC. comparing them against the supply 
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voltage, anil adjusting the servo velocity command several 
times a second to use the available supply voltage most 
effectively. Both servo loops operate al all times; only the 
velocity command is varied during the rewind. 

The tension loop measures the deflection of the buffer 
arm and corrects the velocity of the supply motor until the 
arm returns to its nominal position. This effectively 
matches the linear tape velocities at both reels as the tape 
winds from one reel to the other. If both reels were identical, 
no buffer arm deflection would be required. The buffer arm 
applies a relatively constant force of 10 ounces to the tape 
and thus passively controls the tension. The arm force is 
constant as the arm travels ± 1 inch. The velocity and ten- 
sion loops are approximately decoupled since each motor 
sees a relatively constant load of 10 ounces. 

The tension loop is implemented completely in analog 
form. The buffer arm uses a linear Hall-effect sensor and 
a magnet to convert position to a voltage. This voltage is 
processed by an analog compensator and combined with 
the velocity control voltage to drive the supply motor. The 
key feature of the compensator is that it has an integrating 
term. This always returns the arm to its null point after a 
ramp. It also reduces arm deflections during ramps and 
eliminates final velocity-dependent arm deflections. If the 
integrating term were not present, the high rewind veloci- 
ties of the HP 7980A would not be attainable. 
Open-Loop Operation and Tape Sensors. The autoloading 
feature of the HP 7980A requires individual control of the 
two motors until the tape is loaded. The velocity and ten- 
sion control loops are each followed by a gain/load control 
stage. In the gain mode, this stage can output a voltage 
proportional to an 8-bit number. The processor can perform 
the autoload operations in this mode. When the loops are 
closed, these stages are switched to the gain mode. In this 
mode they operate as programmable gain stages. The pro- 
cessor can adjust the velocity and tension loop gains to 
account for different reel sizes. The gain/load stage is im- 
plemented with an 8-bit multiplying DAC. In the gain 
mode, the voltages generated by the servo loops are fed to 
the reference inputs of the DAC. and the processor adjusts 
the loop gain by writing to the DAC. In the load mode, the 
reference voltage is fixed, and the processor controls the 
motor drive voltage by writing to the DAC. 

Several sensors monitor the progress of the tape during 
the autoload process. A three-line encoder on the supply 
hub has several functions. It determines whether the feet 
on the supply hub have properly engaged the reel, it deter- 
mines the presence of the reel, it is used to control the 
velocity of the supply hub during the tape threading oper- 
ation, and it is used by the processor to determine the reel 
inertia. The three-line encoder is also used with the velocity 
encoder during closed-loop operation to determine the sup- 
ply reel radius instantaneously. 

The beginning-of-tape (BOT) and end-of-tape (EOT) sig- 
nals are determined on half-inch tapes by reflective mar- 
kers. The markers are detected by the tape drive using two 
reflective sensor assemblies. Some tapes are almost as re- 
flective as the markers, and this has been a problem for 
marker detection circuits. The HP 7980A employs a circuit 
that responds to changes in reflectivity, rather than absolute 
levels, to detect the markers. This means that the circuit 



is insensitive to component variations and can handle a 
wide variety of tapes without adjustments. The circuit is 
shown in Fig. 8. The sensor consists of an I.ED and photo- 
transistor pair. The phototransistor receives infrared light 
generated by the LED which bounces off the tape. As a 
marker passes the sensor, the phototransistor emits a cur- 
rent pulse. This current is converted to a voltage VI by 
amplifier Al. V3 is an attenuated version of VI. V2 is a 
filtered version of VI. A2 is an amplifier used as a com- 
parator. A sufficiently large pulse at VI will cause a pulse 
at V„ since V2 does not rise as rapidly as V3. The output 
does not respond to slow variations in tape reflectivity or 
extraneous light sources, but it does respond to the rapid 
change across a reflective marker. 

A tape-in-path sensor determines when tape has entered 
the path. The sensor is a phototransistor and LED pair that 
shines infrared light across the tape entry point near the 
supply reel. The sensor is designed without adjustments 
by using a pulsed technique for detection. The detection 
circuit is similar to the circuit of Fig. 8. In this case the 
LED is pulsed by the processor for a short time with high 
current. If there is tape in the path, no output is detected 
at V„. Otherwise, a fast change is detected at V„. 

An eight-port ADC is used to perform diagnostics on the 
analog circuits. It measures the buffer arm position during 
load and signals the processor to close the servo loops. It 
measures the motor voltages during rewind to adapt the 
rewind velocity. It is also used during self-tests to perform 
a motor-drive loopback, a tachometer loopback, and a 48V 
supply voltage check. 

Tape Positioning Control. Unlike a stop/start drive, stream- 
ing tape drives do not have the ability to stop and then 
restart the tape within the distance of an interblock gap. 
Each time the tape is stopped, a streaming tape drive must 
reverse the tape until it is well ahead of the next record to 
be read. The tape can then be ramped up to speed and the 
read electronics activated upon reaching the interblock gap 
where stopping was initiated. While stopping and starting 
the tape the read head may traverse an unknown number 
of records. Unfortunately, the 6250 GCR and 1600 PE tape 
formats do not support a standard mechanism for identify- 
ing individual records. Therefore, tape repositioning must 
be performed based on physical distance along the tape. 
The HP 7980A Tape Drive uses an optical shaft encoder 
mounted to the capstan wheel in the tape path. This en- 



Tape 



Reflective Marker 




Photosensor 
Assembly 



Fig. 8. Beginning and end-of-tape sensor circuit. 




52 HEWLETT-PACKARD JOURNAL JUNE 1988 

© Copr. 1949-1998 Hewlett-Packard Co. 



coder provides two quadrature pulse trains. A quadrature 
decoder chip (QDC) converts these pulses into a relative 
tape positioning count with the use of an up/down counter. 

Because of the high density and small gap size of the 
GCR recording format, it is important thai relative tape 
positions supplied by the QDC be as accurate as possible. 
When a block-to-gap or gap-to-block boundary is sensed 
by the read circuitry, it is essential that the position ob- 
tained by the QDC at that instant be captured and stored 
away for later use. Although the QDC is a bus-compatible 
device, time delays from the occurrence of a block bound- 
ary to the point at which the microprocessor senses that 
event and reads the position can introduce unacceptable 
errors. To avoid the latencies caused by the additional mi- 
croprocessor cycles, position capture circuitry was de- 
signed around the QDC. This circuitry monitors inputs 
from the read formatter which signal when a block bound- 
ary has been crossed. When the event occurs, external 
hardware extracts the position information from the QDC 
and stores it in the appropriate location of a 4*1 2-bit regis- 
ter file. The position is captured within 2.4 /is of the event. 
This translates into a maximum position error of only 
0.0003 inch. 

Upon stopping the tape, the last valid block-to-gap posi- 
tion is used to mark the gap that follows the last block 
read. The tape is stopped, then restarted in reverse. Moni- 
toring the up/down counter, the drive controller waits until 
the read head is in the target interblock gap. The read elec- 
tronics is then activated and the tape position at the next 
gap-to-block boundary is latched. This position will be the 
same as the block-to-gap position previously latched plus 
any positioning error encountered in stopping and starting 
of the tape. The tape is then brought to a stop and restarted 
again in the forward direction upon receiving a new read 
command. Again, using the up/down counter to monitor 
tape position, the drive controller activates the read elec- 
tronics in the interblock gap before the next record to be 
read. By recapturing the block-to-gap position in reverse 
as a gap-to-block position, the drive controller prevents the 
accumulation of tape positioning error. Each time a posi- 
tion is recaptured, the amount of positioning error is calcu- 
lated by subtracting the two tape position counts. If (his 
error becomes excessive it is reported to the host, thus 
indicating a problem with the drive. 

Vacuum-Fluorescent Front Panel 

We had two basic objectives in designing the HP 7980A's 
front-panel display. The first was to provide an HP-quality 
solution, especially since the display is a focal point for 
the user of the drive. We analyzed a number of factors, 
ranging from the angle of the display to its color and clarity. 
The second objective was to provide a convenient means 
of displaying tape drive status that would be viewable from 
across the length of an average computer room. Because of 
the long-distance viewing requirement, special attention 
was placed upon character size, color, and contrast, and 
upon the physical placement of annunciators. After a thor- 
ough analysis of the options available to us. we converged 
on a design employing a vacuum-fluorescent display. 
Display Operation. In simple terms, the vacuum-fluores- 
cent display (VFD) is a directly heated triode vacuum lube. 



The VFD consists of an electron-emitting cathode filament, 
a screened phosphor anode, and a grid (located between 
the anode and cathode) to control electron flow. Therefore, 
the VFD is structurally similar to old radio or television 
vacuum tubes, yet it performs a totally different function. 
Besides the fact that VFDs are optical displays and not 
amplifiers. VFDs are often constructed with multiple triode 
structures included within the same evacuated tube. This 
feature allows the implementation of cost-effective, complex 
displays. Examples of these inexpensive yet complex VFDs 
can be found in all areas of modern consumer electronics. 

VFD segments are illuminated when electrons emitted 
by the heated cathode (filament) are accelerated by the 
positively biased anode and are allowed by the positively 
biased grid, or mesh, to collide with the screened phosphor. 
This causes photons to be emitted from the phosphor at a 
frequency that depends on the type of phosphor used. This 
feature allows the design of multicolored displays by screen- 
ing different phosphor patterns on the same display. Al- 
though this multicolored feature is attractive, it is not totally 
free. Standard available phosphors emit photons at lumin- 
ous efficiencies that can vary by as much as 50 percent. 
This causes a brightness balance problem between different 
phosphors on the same display. Brightness is typically bal- 
anced using optical filters, different power supplies, or 
different duty cycles. 

The cathode filament, a thin oxide-coated tungsten wire, 
emits thermal electrons when heated to around 600°C by 
a filament power supply. The filament power supply is a 
major part of the power consumed by a VFD. 

The VFD display used in the HP 7980A (Fig. 9) uses an 
ac filament supply providing 2.8Vrms and 0.47W al 715 
Hz. a 30Vdc, 0.6W power supply which provides power 
for the anode and grid drivers, and a 40-pin IC which pro- 
vides level shifting (0-5V to 0-30V) and digital control for 
18 anodes and seven grids. 

Why a Vacuum-Fluorescent Display? There an; five reasons 
for choosing a vacuum-fluorescent display — cost, reliabil- 
ity, versatility, market separation, and human factors. The 
VFD is very cost-effective compared to other options avail- 
able. One other display technology, the liquid-crystal dis- 



r — 


roairwn unload 1600 800 
ISUHUia WRTEN 6250 BB 

® &Ii fWl \W\ IW/I IM7/ \W\ 
//IN| l/hf 0\| l/hl 


^ 1 
' 1 

f 


m 




m 



Fig. 9. Vacuum-fluorescent panel ol HP 7980A 



© Copr. 1949-1998 Hewlett-Packard Co. 



JUNE 1988 HEWLETT-PACKARD JOURNAL 53 



play, was slightly less expensive, but caused concern re- 
garding low-light viewability and electrostatic discharge 
sensitivity during assembly. 

Reliability is probably the VFD's greatest asset. With a 
glass-encased body and an evacuated interior, it is imper- 
vious to electrostatic discharge damage and can endure 
harsh environments. Its low power consumption relative 
to LED displays also allows enhanced reliability. 

Since the VFD's light output is multicolored, it can be 
easily filtered to almost any color desired with standard 
colored lenses. In the HP 7980A Tape Drive, the blue-green 
output is filtered to almost white without significantly af- 
fecting the amber light output. 

Many other tape drives have LED displays and annun- 
ciators. By using a more consumer-oriented display (the 
VFD is standard fare on VCRs), we were able to achieve a 
more contemporary look. 

The contrast of the VFD's bright phosphor with the dark 
filtered background gives excellent viewability from up to 
twenty feet while not being too bright at close range. 
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conveners and communications circuits, using 
bipolar, NMOS, CCD. and GaAs technologies. 
John holds a BSEE degree Irom Ihe Universily of 
Iowa and an MSEE degree Irom Stanford He is a 
member of the IEEE and serves on me program 




56 HEWLETT-PACKARD JOURNAL JUNE 1988 



© Copr. 1949-1998 Hewlett-Packard Co. 




comm-nee ol tne international So'id-State Grouts 
Conference He lives kl Porttna Valley. California 
ana enjoys sfcimg. tennis ana wviemaking 

Ken Poulton 

I Ken Poulton s primary tunc- 
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A One-Gigasample-per-Second Digitizing 
Oscilloscope 

This instrument's high sampling rate makes it particularly 
useful for analyzing high-speed, one-shot occurrences. A 
blend of state-of-the-art designs was required to achieve 
this performance. 

by Joe K. Millard 



OVER THE PAST FOUR YEARS, the demand for 
digitizing oscilloscopes has grown at a rate of about 
30 percent per year. For CRT-based analog oscillo- 
scopes, this rate has dropped to about six percent. The 
growing preference for digitizing oscilloscopes is driven 
by several key advantages of their architecture. First, con- 
tinuous digitization of analog waveforms allows the user 
to view and analyze pretrigger waveform information. This 
feature is essential in diagnosing faults in digital systems 
that lead to a failure from which a trigger is derived. Second, 
total bus control of the digitizing oscilloscope, including 
acquisition and transfer of waveform data, meets the grow- 
ing needs of automatic test and measurement systems. 
Third, waveform record lengths are not time-limited by the 
CRT display, but only by the available high-speed memory 
which can be the equivalent of many CRT screen widths. 

Until now, the primary disadvantage of digitizing oscil- 
loscopes has been the lack of large single-shot bandwidths 
which are essential for digital system diagnostics, high- 
energy physics studies, transient interference analysis, and 
many other applications. The minimum bandwidth re- 
quired for a large number of these applications is in the 



range of 200 to 500 MHz. 

The obvious problem in producing a large single-shot 
(frequently called real-time) bandwidth is the difficulty of 
sampling and analog-to-digital (A-to-D) conversion at rates 
of about four limes the desired bandwidth. Although the 
Nyquist criterion only requires a sample rate of twice the 
signal frequency for accurate reconstruction of sine waves, 
a ratio of about 4:1 (sample rate/bandwidth) is required for 
accurate reconstruction of arbitrary waveforms. Attempts 
to raise the signal band width to values closer to the Nyquist 
frequency result in increasing overshoot and ringing of the 
reconstructed system step response as the Nyquist limit is 
approached. 

The HP 5411 ID Oscilloscope (Fig. 1) was developed to 
address the high-bandwidth single-shot applications indi- 
cated above. The one-gigasample-per-sei "ml s.unpliiii; ami 
A-to-D conversion technology used in the product was de- 
veloped by HP Laboratories (see article on next page). The 
high-speed NMOS memory required for long waveform rec- 
ords at minimal power was developed at Colorado Springs 
Division and processed at the Loveland Technology Center. 
A general-purpose front-end attenuator and preamplifier 
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were also designed at Colorado Springs Division (see article 
on page 67) using custom bipolar lCs and thick-film hybrid 
technologies from the Santa Clara Technology Center and 
the Colorado Springs Technology Center, respectively. De- 
scriptions of several of these developments will be pre- 
sented in the following pages. Digital signal processing 
techniques incorporated for optimizing signal quality will 
also be described (see article on page 70). 



A One-Gigasample-per-Second 
Analog-to-Digital Converter 

by John J. Corcoran, Ken Poulton, and Knud L. Knudsen 



EACH CHANNEL of Hewlett-Packard's Model 541 1 1 D 
Oscilloscope contains a one-gigasample-per-second 
analog-to-digital converter (ADC) which digitizes 
the signal from the preamplifier and writes the result to 
digital memory. The characteristics of this ADC to a large 
extent determine the horizontal and vertical resolution 
of the oscilloscope. The one-gigasample-per-second sample 



rate, for instance, directly determines the achievable 
bandwidth in single-shot operation, an impressive 250 
MHz. 

Four main requirements shaped the architecture of the 
HP 5411 ID ADC: 

■ An 8000-sample memory depth was desired to allow the 
capture of an eight-microsecond time record at the full 
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Repetitive versus 
Single-Shot Bandwidth 



There is often confusion among users of digitizing oscillo- 
scopes about the definitions of repetitive bandwidth and single- 
shot bandwidth, and how these two quite different specifications 
relate to the sampling rate of the ADC used in the instrument 

Smgle-shoi bandwidth is important for those applications 
where the signal occurs infrequently and it is necessary to digi- 
tize the entire event in a single acquisition cycle These applica- 
tions include those for which traditional storage oscilloscopes 
were used In the past, and a host of others made possible by 
the digital properties of the new instruments 

Repetitive bandwidth applies when the signal occurs often 
enough that the waveform can be built up from tens or hundreds 
of successive acquisition cycles to make a composite image on 
the screen These applications include those where a traditional 
(nonstorage) oscilloscope was used in the past The key differ- 
ence is that single-shot bandwidth is proportional to the ADC 
sampling rate, and repetitive bandwidth is quite independent of it 

Single-shot bandwidth is directly related to the ADC sample 
rate by the Nyquist sampling theorem This theorem states that 
to reconstruct a waveform accurately from a limited set of sam- 
ples of that waveform, the sampling rate must be more than twice 
the highest frequency component in the waveform If this condi- 
tion is violated, frequency components of the input that are above 
the Nyquist frequency (half the sampling frequency) will be trans- 
lated by the sampling process to different frequencies below the 
Nyquist Irequency This frequency shifting (also known as alias- 
ing) causes errors that cannot be corrected m the sampled data. 
To avoid these errors, the signal bandwidth must be limited to 
less than half the sampling rate, either by its intrinsic characteris- 
tics or by an anti-aliasing filter in the signal path Then, to recon- 
struct the entire input signal correctly from the limited set of 
samples taken a digital low-pass filter is used to interpolate 
between the actual sample points. Both the anti-aliasing filter 
and this reconstruction filter must have significant attenuation at 
the Nyquist frequency to avoid errors As a practical matter, to 
avoid excessive overshoot in the step response of these filters, 
their composite bandwidth cannot exceed about one-fourth the 
sample frequency Thus the filter requirements reduce the 
achievable single-shot bandwidth from the theoretical one-half 
of the sampling rate to the more practical one-fourth of the sam- 
pling rate 

Repetitive bandwidth, by contrast, is totally independent of the 
A-to-D sampling rate In repetitive sampling, an image of the 
signal is built up by interleaving the samples acquired in multiple 
occurrences of the input signal. The proper placement (on the 
time axis) of each set of samples is guaranteed by a special 
circuit in the instrument that measures the time delay between 
the internal sampling clock and the trigger, which occurs at Ihe 
same point on the signal for each acquisition. The composite 
image thus built up has a very high effective sampling rate, and 
there is no need for either anti-aliasing or reconstruction filters 
The bandwidth in this mode then depends only on analog com- 
ponents like the input attenuator, preamplifier, and sample-and- 
hold circuit, and not on the A-to-D sampling rate 

John J. Corcoran 

Project Manager 
Hewlett-Packard Laboratories 



single-shot bandwidth. 

■ The power consumption of the unalog-to-digital subsys- 
tem was required to be less than 25 watts per channel 
because of instrument power constraints. 

■ The resolution of the ADC had to be consistent with 
accurate waveform reconstruction. 

■ A 1-GHz analog-to-digital conversion bandwidth was de- 
sired so that the overall goal of a 500-MHz instrument 
bandwidth for repetitive signals could be readily 
achieved (see box at left for the distinction between 
single-shot and repetitive bandwidth). 

The requirement for an 8000-sampIe memory system 
made it difficult to use an indirect A-to-D architecture, that 
is. one that requires intermediate analog storage. In an in- 
direct A-to-D system, the samples are captured at the full 
sample rate and stored in an analog memory (for example, 
a charge-coupled device). The samples are then read out 
slowly and digitized by a low-speed ADC. This architecture 
offers the attractive advantages of a lower-speed A-to-D 
converter and digital memory. However, errors that in- 
crease with the storage time of the analog storage device 
and the power and complexity of the associated clock driv- 
ers limit the total practical storage capacity. Hence, the 
decision was made to build a direct ADC. where the input 
signal is digitized at the full sample rate and the data is 
stored directly in high-speed digital memory. The memory 
depth then can be increased without affecting the analog 
accuracy by simply adding digital memory chips. 

The power consumption constraint immediately made 
it clear that custom integrated circuits would be required 
throughout the A-to-D system. A board-level implementa- 
tion, using commercially available components, would 
have required too much power and probably could not 
have met the speed objectives. Next, each IC would need 
to be fabricated in a state-of-the-art technology with excel- 
lent power-delay product, whose performance characteris- 
tics are well-matched to the needs of the individual ICs. 
This led to the use of three different IC technologies: 
b Gallium arsenide (GaAs) is used for the front end of the 

system, where the speed requirements are the greatest. 

■ Silicon bipolar circuitry is used to digitize the signals, 
because of its excellent device matching and high trans- 
conductance. 

■ Silicon NMOS is used for the memory, since it offers an 
excellent combination of speed and device density. 
The silicon bipolar digitizer IC. which uses a modified 
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Fig. 3. A simply interleaved A-to-D system Each digitizer is 
driven from a separate sample-and-hold circuit 
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flash architecture, sets the resolution of the system. In the 
flash architecture there is one comparator for each input 
level to be resolved. Thus the transistor count and power 
are proportional to 2 N . where M is the number of bits of 
resolution. It was decided early that a six-bit digitizer could 
both meet the power budget of the system and provide 
quite acceptable waveform reconstruction, especially when 
combined with some dithering and digital signal process- 
ing features (see the article on poge 70). With these addi- 
tions, the effective resolution of the system is seven or even 
eight bits for slowly varying signals, where the higher res- 
olution is most useful. 

To achieve the 1-GHz A-to-D system bandwidth goal, it 
became quite clear that a sample-and-hold circuit would 
be required. A flash converter intrinsically samples the 
input signal in its clocked comparators, but the usable input 
bandwidth is typically much less than 1 GHz. For (he digi- 
tizer designed for this system, the bandwidth lor accurate 
conversion without an external sample-and-hold circuit is 
about 100 MHz. Thus, it was necessary to add an external 
sample-and-hold circuit, and investigation showed that 
only a GaAs FET IC could supply the needed bandwidth 
and acquisition time. 

Digitizer 

The silicon bipolar integrated circuit implementing the 
digitizing function for the system uses a flash architecture, 
which generally yields the highest A-to-D sample rate pos- 
sible within a given IC technology. However, even with 
HP's oxide-isolated 5-GHz IC process (the fastest bipolar 
process available at the time), a digitizer could not be built 
that could be clocked at the required one-gigasample/sec- 
ond rate. An even more severe constraint was the require- 
ment to drive and settle the input capacitance of the digi- 
tizer with the step-like input signals supplied by the sam- 
ple-and-hold circuit. Simulations showed that the best 
achievable settling time, with margins appropriate for a 
production instrument, was about 4 ns. Thus it was decided 
to build a digitizer that could be clocked at 250 megasam- 
ples/second (a 4-ns period), and interleave four of these to 
achieve the required net sample rate of 1 GHz. 

A block diagram of a traditional flash ADC is shown in 
Fig. 1, For six-bit resolution. 64 clocked comparators are 
required. The reference voltage V REF , equal to the full-scale 
input of the converter, is divided by the resistor network 
and distributed to the comparators. When the comparators 
are clocked, their outputs go to a logical one or zero state, 
depending on the relative value of the comparator reference 
and input signal at the time the clock occurred, resulting 
in a "thermometer code" with all zeros above the com- 
parator nearest the input voltage, and all ones below that 
comparator. The thermometer code contains a single point 
where an adjacent one and zero occur, and this is detected 
by one of the exclusive-OR gates. The high exclusive-OR 
output then drives the encoder to produce the correct bi- 
nary value, which is the output of the ADC. 

Although a flash converter is very fast, it has several 
disadvantages. The device count and power are both high, 
because of the fully parallel nature of the architecture. For 
the same reason, the input and clock capacitances are high, 
and these lead to practical problems in driving the con- 



verter from other circuits. To minimize these problems, we 
use a circuit that we call an analog encoder 1 3 to reduce 
the number of clocked comparators required. 

A block diagram of an analog encoder is shown in Fig. 
2. It consists of four differential pairs, connected with col- 
lectors alternately crossed as shown. The input signal 
drives one side and the reference voltages drive the other 
side of the differential pairs. The resulting transfer function 
of this circuit, also shown in Fig. 2. changes between a 
high and a low level four times. These changes occur when 
the input voltage is equal to one of the reference voltages 
applied to the differential pairs. This circuit essentially 
implements the functionality of four comparators and four 
exclusive-OR circuits in the traditional flash converter, ex- 
cept that the output is analog. Therefore, a single clocked 
comparator is placed after the analog encoder. The net 
result is that four differential pairs and one clocked com- 
parator replace four clocked comparators and four exclu- 
sive-OR gates. This is a net gain of about a factor of four in 
transistor count and a factor of at least two in power. Fur- 
thermore, since the number of clocked comparators is about 
four times lower, the clock input capacitance is similarly 
reduced. 

The six-bit digitizer is implemented using seventeen of 
these analog encoders and seventeen clocked comparators. 
To produce the least-significant bits of the output data 
(where more than four transitions must be encoded), the 
outputs of several encoder-comparator groups, with ap- 
propriately interleaved thresholds, are logically combined. 

The digital encoder produces a Gray-coded output. Gray 
code is a method of coding 2 N levels in N bits. Its main 
characteristic is that a transition from any level to a neigh- 
boring level causes only one output bit to change. The more 
common binary coding has points where going from one 
level to the next can change all output bits simultaneously. 
If a comparator in a binary digitizer should be undecided 
because its reference voltage is equal to the input, then the 
output can produce large errors if it is interpreted differ- 
ently by the encoding paths for the different output bits 
affected. But in a Gray-coded digitizer, each comparator 
affects only one output bit, and hence, if a comparator 
should be undecided, the output can only be interpreted 
to be one of two adjacent codes, which can be considered 
to be equally correct measures of the input voltage. 

Although analog encoded architecture reduces the de- 
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Fig. 4. E Itect ol uneven sample timing in an interleaved sys- 
tem 
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vice count, power, and clock capacitance, it does nothing 
to reduce the high capacitance at the analog input, which 
is typical of flash converters. Simulations showed it would 
be impossible to drive and settle the expected input capaci- 
tance through the inductance of the bond wire used to 
connect the digitizer's input to its package. To eliminate 
this problem, an on-chip input buffer, optimized for step 
settling, was added to ihe digitizer. This buffer reduces the 
effective input capacitance from 25 pF to 4 pF and allows 
settling in the required 4 ns. 

The resulting digitizer chip* is quite small, measuring 
only 2.2x3.5 mm. This chip is a complete ADC in its own 
right; it can be clocked at rates up to 400 MHz, and it can 
be used without a sample-and-hold circuit on signal hand- 
widths of up to 100 MHz. It is currently used in the HP 
54201 and HP 54112 Digital Oscilloscopes and in an op- 
tional board for the HP 16500A Logic Analyzer. Power 
dissipation of the digitizer is 2.5 watts, so the four chips 
required for the 1-GHz A-to-D system consume a total of 
10 watts. 

Interleaving Digitizers 

To meet the required sampling rate of 1 GHz, four of the 
silicon digitizers are interleaved, each operating at 250 
MHz. A simply interleaved A-to-D system is shown in Fig. 
3, where each digitizer has its own sample-and-hold circuit. 
The problem with this arrangement is that the four samplers 
sample the input signal separately, so any errors in their 
relative timing will be reflected as inaccurate sampling. As 
shown in Fig. 4. a deviation At in the desired timing of 
sample 3 results in an error AV in the voltage displayed 



for that sample. For high-slew-rate signals, such as a full- 
scale, 500-MHz input signal, maintaining six-bit accuracy 
requires that the relative timing be accurate to 5 ps. With 
the IC and packaging processes available, timing errors of 
around 50 ps were expected. 

To avoid the 5-ps timing accuracy requirement, a fifth 
sample-and-hold circuit, operating at 1 GHz, was placed 
in front of the four sample-and-hold circuits operating at 
250 MHz (Fig. 5). This first-rank sample-and-hold circuit 
samples for 500 ps, acquiring Ihe input signal on its hold 
capacitor. Then it goes into hold for 500 ps, which creates 
a flat spot on an otherwise quickly moving waveform. The 
four second-rank sample-and-hold circuits each sample for 
1 ns and hold for 3 ns. They will each finish their acquisi- 
tion time during one of Ihe 500-ps flat spots. Since they 
finish their acquisition on a signal that is no longer moving, 
the timing of their clock signals is no longer critical. Pro- 
ducing a sufficiently stable clock for the single first-rank 
sample-and-hold circuit is relatively easy. 

Sample-and-Hold Circuits 

The heart of each sample-and-hold circuit is the diode 
bridge sampling gate. 3 A simplified schematic is shown in 
Fig. 6. In the sample mode. Icontrol flows down through 
the four diodes, D1-D4, forming the bridge. If the voltage 
V| N is the same as V 0 |, T . Ihe current is split evenly between 
diodes Dl and D3 and no charging of the 0.3-pF hold 
capacitor takes place. However, if V, N is higher than V OIIT . 
the voltages across Dl and D4 will be smaller than the 
voltages across D3 and D2. Hence, the current in D3 will 
be larger than the current in D4, which causes a net current 
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250 Ms s 




Output I 
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Output 3 



Output 4 

Fig. 5. Sample-and-hold chip 
block diagram. All buffers are unity- 
gain source followers. The bridges 



ship 24 are diode bridge sampling gates. 
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flowing into the hold capacitor until V, N and V oirT are the 
same. 

The bridge is turned off (put into hold mode) by reversing 
the current Icon-trim ■ 'n this case the current flows through 
diodes D5 and D6. which places a reverse bias on diodes 
D1-D4. breaking the connection between V (N and V OUT . 

A diode bridge sampling gate has several key advantages 
over simpler circuits such as pass-FET circuits: 

■ The circuit turns on and off very quickly. 

■ The diode voltages need only decrease by approximately 
200 mV (8V T ) to turn off the bridge effectively. 

■ The circuit is intrinsically balanced, so there is (to first 
order) no clock feedthrough onto V OUT . 

■ The hold isolation is quite good. Diodes Dl, D5. and D3 
effectively form a series-shunt-series isolation circuit in 
the off state, as do diodes D2. D6. and D4. 

A distinction is sometimes made between a sample-and- 
hold and a track-and-hold approach. If the charging of the 
hold capacitor is allowed to occur until V, N and V OUT 
equalize, the circuit is often called a track-and-hold circuit. 
If the on interval is shorter than the settling time, the circuit 
is called a sample-and-hold circuit. Using GaAs ICs allowed 
us to design our circuit as a track-and-hold circuit — the 
voltage on the hold capacitor settles completely to V, N dur- 
ing the 500-ps on time. The track-and-hold circuit has a 
higher gain than a sample-and-hold circuit and it is insen- 
sitive to variations in the pulse width of the sampling clock. 
In the rest of this article, we shall use the terms track-and- 
hold and sample-and-hold interchangeably. 

HP's GaAs IC process uses \-fim gate lengths to build 
depletion MESFETs (metal-semiconductor FETs) with 
a pinchoff voltage V p of -2V and a unity-current-gain fre- 
quency tf of 14 GHz. Since a MESFET contains a Schottky 
diode junction, a fast diode (RonQjff = 1 P s ) tan ' ,e made 
from a MESFET by connecting its drain and source together 
as one electrode and using the MESFET's gate as the other. 
This process also includes integrated resistors and 
capacitors. A GaAs FET process is particularly well-suited 
to building sample-and-hold circuits because: 

■ There are fast Schottky diodes for building diode bridge 




0.3 pF 



circuits. 

■ The FET speed allows construction of fast switching 
circuits for control of the bridges and very fast source 
follower circuits for buffering the inputs and outputs of 
the hridges. 

■ The semi-insulating GaAs substrate reduces parasitic 
capacitances. 

This process is described in more detail in reference 4. 

Thu five sample-and-hold circuits shown in Fig. 5 are 
integrated on one GaAs chip. A companion clock genera- 
tion chip, also built in GaAs. accepts the input clock (at 1 
GHz or below), amplifies it lo GaAs levels (3V swings), and 
provides the clocks for the five sample-and-hold circuits 
and the four digitizers. Various characteristics of the chips 
used in this ADC are given in Table I. 



Table I 



Chip: 



Sampler 



Control 



Digitizer 



Technology: GaAs D-MESFET GaAs D-MESFET Si Bipolar 
1-pmGate 1-AimGate 5-GHzf, 

Chip Size: 1.4x1.7 mm 

Device Count: 460 

Number of Pads: 40 



1.1x1.7 mm 2.2 - .'1.5 mm 
390 920 
35 52 



Power: 



2.9W 



2.5W 



2.5VV 



Increasing Resolution 

The major problem with an interleaved ADC is the need 
to provide good matching between the four digitizers, both 
in time and in voltage. The need for matching in time is 
addressed by the use of two ranks of sample-and-hold cir- 
cuits. The need for matching in voltage is addressed by 
controlling the voltage references used by the four digitizers 
with individual digital-to-analog converters that are set by 
the system processor during a self-calibration cycle. 

However, having multiple digitizers can bean advantage. 
The four 6-bit digitizers' reference voltages can be offset 
from each other in steps of 1/4 LSB (least-significant bit), 
that is. in steps of l/256th of the input voltage range. This 
causes the four digitizers to be interleaved in voltage, as 
well as in time. This allows 256 levels to be resolved for 
slowly changing input signals, so the resolution for low-fre- 
quency input signals can be eight bits rather than six bits. 

To display a resolution of 256 levels, the data from all 
four digitizers must be combined. This can be done with 
a nonlinear smoothing filter, which moves each sample 
value towards the average of four adjacent samples, but 
never moves a sample by more than half a six-bit LSB (see 
' Dithering in the HP 5411 ID," page 72). Although this 
mechanism can give true 8-bit resolution on sine waves 
only if the signal frequency is below 1 MHZ, it preserves 
the instrument signal bandwidth and provides a significant 
enhancement in resolution on the flat portions of pulse 
waveforms of any speed. 



Fig. 6. Simplified schematic ol the diode bridge sampling 
gate . I control reverses direction to turn the sampling gate oil 



System Performance 

The frequency response of the ADC, sampling at 1 Cllz, 

(connnuod on page 65) 
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Digitizer Hybrid 



The Ihree-inch-square digitizer hybrid circuit (see Fig 1 ) in the 
HP 541 1 1 D Digitizing Oscilloscope contains the GaAs track-and- 
hold chip, the GaAs clock and mode control chip, and the four 
six-bit. Gray code, bipolar analog-to-digital flash converters that 
operate at 250 million conversions per second Two main con- 
tributions that the thick-film technology makes to the circuit are 
high-frequency performance and high power dissipation 




Fig. 1. The one-gigasample-per-second digitizer hybrid cir- 
cuit used m the HP 541 11 D Digitizing Oscilloscope The GaAs 
sample-and-hold and clock generation chips are in the center 
The tour 250-MHz bipolar digitizer chips are located in each 
corner. 

The high-frequency performance is made possible by mount- 
ing the IC chips directly onto the thick-film substrate and then 
wire-bonding the chip-to-substrate connections This eliminates 
the need for an intermediate level of packaging, which would 




Fig. 2. Close-up view ol the two GaAs IC chips in the center 
ot the digitizer hybrid circuit ot Fig 1 

add delay and parasitics to the circuit Fig 2 shows a close-up 
view of the two GaAs chips mounted in the center of the substrate. 

The track-and-hold and clock controller chips are wire-bonded 
directly together (chip-to-chip) This keeps the lead lengths be- 
tween them as short as possible to minimize delay, the parasitic 
inductance of each lead, and the skew between the lines 

The circuit uses 50 and 75-ohm transmission lines for the 
analog input signals and clock lines, respectively Thick-tilm tech- 
nology provides well-controlled impedances for transmission 
lines It also makes possible the fabrication of very small resistors 
to minimize parasitics. 

The second contribution that thick-film technology makes to 
the circuit is in dissipating power — lots of power. The hybrid 
circuit dissipates 17 watts. The two GaAs chips, dissipating 6W, 
are attached directly to a heat spreader that fits through a hole 
in the center of the substrate (refer to Fig 2) This heat spreader 
has a thermal coefficient of expansion that nearly matches that 
of the 96% alumina substrate and has a thermal conduction value 
between copper and tungsten. Attaching the chips directly to 
the heat spreader results in a decrease in junction temperature 




Fig. 3. Data acquisition board tor 
the HP 541 11 D. The digitizer hy- 
brid circuit is located beneath the 
shield in the center 
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of nearty 20*C compared to mounting the chips on the alumina 
The chips are mounted to the heat spreader using conductive 
epo*y Although the epoxy process has a higher thermal resis- 
tance than a soloer process it is less costly and simpler 

The 148 interconnections between the hybrid circuit ana the 
pnntea circuit board are made using flex circuits Rex circuits 
consist of flat beryllium copper conductors etched on a poiyimide 
film The tour flex circuit strips are vapor phase soldered around 
the perimeter of the substrate, and the other ends of the strips 
are soldered to the pnnted circuit board The flex circuits provide 
excellent high-treauency performance and allow stress relief be- 
tween the hybrid circuit and the printed circuit board on which 
the hybrid is mounted 

A low-profile large-area heat sink mounts on the back of and 
extends through the printed circuit board The heat sink contacts 



the bottom of the hybnd circuit at the heat spreader and directly 
beneath each of the four ar>.aiog-to-digitai converters each of 
which dissipates nearly 3W Five ceramic lids protect the hybnd 
orcuit's wire-bonded IC chips A shield covers the hytxid circuit 
ana presses down on these lids ensunng that good contact 
exists between the heat sink and the back of the hybnd circuit 
Rg 3 shows a picture ol the data acauiSHion board The digitizer 
hybrid circuit is unaerneath the metal shield m the center and 
the ilex circuit leads are iust visible around the perimeter of the 
shield 

Lewis R Dove 
Mark E. Mathews 

Development Engineers 
Colorado Integrated Circuits Division 



(continued Horn page 63l 

is shown in Fig. 7. The 3-dB bandwidth is 1.7 GHz and 
the distortion is below -40 dBc (173 L.SB for 6-bit resolu- 
tion) for input signals up to 500 MHz. (The performance 
measurements quoted here are for the A-to-D system alone, 
and do not include the HP 541 1 1 D's input attenuator and 
preamplifier circuits.) Any timing misalignments between 
the four digitizers would show up as frequency components 
Isampling products) around 250 MHz: these components 
are suppressed to below -40 dBc by the use of the two-rank 
sample-and-hold architecture. 

Effective bits of resolution is a measure of analog-to-dig- 
ital conversion accuracy; s it measures an ADC's perfor- 
mance in digitizing a sine wave, compared against a 
hypothetical ideal ADC of that resolution. Any noise, non- 
linearity, or missing codes will cause an ADC's effective 
bits of resolution to fall below its nominal resolution. Thus, 
a perfect, noiseless. 6-bit ADC will show 6,0 effective bits 
of resolution: a poor one might show 3.5 effective bits. 

The resolution, in effective bits, achieved by this ADC 
while sampling at 1 GHz is shown in Fig. 8. Here. 5.2 
effective bits are measured all the way up to a 1-GHz input 
frequency. The limiting factor in this case is the mismatch 
between the four digitizers, since a singledigitizerachieves 
5.9 effective bits of resolution at 250 megasamples/second. 

io -r 



At lower input frequencies, the number of effective bits 
rises to nearly 7 as the higher resolution afforded by the 
nonlinear smoothing filter comes into effect. At low fre- 
quencies, the number of effective bits is limited mainly by 
the nonlinearities of the digitizers. 

Fig. 9 shows the result of a beat frequency test using the 
voltage interleaving and nonlinear filtering described 
above. The larger sine wave is a 250-kHz sinusoidal signal 
sampled at 1 GHz. The smaller sine wave is the result of 
sampling a 1.000250-GHz sinusoidal signal. Even at a 1- 
GHz input frequency, the digitized waveform is clean with 
no visible aberrations. 

Packaging 

The A-to-D system (that is. the GaAs sample-and-hold 
chip, the GaAs clock generation chip, and the four bipolar 
digitizer chips) is packaged on a single thick-film hybrid 
substrate (see box on page 64). The two GaAs chips are 
side by side in the center and surrounded by monoblock 
bypass capacitors and thick-film termination resistors. The 
four digitizer chips are placed near the four corners of the 
hybrid, along with their associated capacitors and resistors. 
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Front-End Signal Conditioning for a 
High-Speed Digitizing Oscilloscope 

by Thomas K. Bohley, Mark E. Mathews. Lewis R. Dove, Joe K. Millard. David W. Bigelow. and Donald 
D. Skarke 



CRITICAL TO THE ACHIEVEMENT OF high-perfor- 
mance in any digitizing oscilloscope is the design 
of the front-end signal conditioning stages. The de- 
sign of these stages in the HP 5411 ID Oscilloscope was 
particularly difficult because of the HP 5411lD's 500-MHz 
required signal bandwidth. Key elements in the design are 
the attenuator, the switch assembly, the hybrid circuit upon 
which the attenuator is fabricated, and the preamplifier. 

Attenuator 

The theory of operation of a compensated one-megohm 
attenuator, shown in Fig. %, can be found in any text on 
linear waveshaping. Why then have they not been used in 
500-MHz oscilloscopes before? The answer can be found 
in Fig. 2; not in the divide-hy-10 attenuator sections that 
can be cascaded, but rather in the straight-through divide- 
by-1 path. It is in that path that the inevitable series para- 
sitic inductance is at a maximum, and it is in that path 
that the series inductance and the input capacitance of the 
following preamplifier form a passive resonant circuit that 
Introduces ringing on input pulses and limits the bandwidth. 

An effective solution to this problem is to raise the reson- 
ant frequency high enough so that the oscilloscope's ampli- 
fier no longer responds to that resonant frequency. Gener- 
ally, attempting to damp the resonant circuit by introducing 
a series resistance in the inductive path results in creating 
an RC corner in the passband of the following amplifier. 

To raise the resonant frequency we must make both the 
series inductance and shunt capacitance as small as possi- 
ble. The shunt capacitance, determined mostly by the input 
FET. is already small since a high-frequency FET has been 
chosen. This leaves the series inductance to be minimized 
and the most effective way of doing that is simply to limit 
the physical length of the straight-through path. 

The attenuator is constructed on a thick-film 0.5 xi, fl- 
inch substrate with the input and output separated by about 
0.5 inch. To build an attenuator of this size would not be 
possible without one more technique — moving the ac cou- 
pling capacitor from the attenuator itself, where the capaci- 



C1 



In 



PH. 

4 1 W\ • f- 

h 



Out 



tSS 



R1C1 a R2C2 

Fig. 1. Compensated attenuator circuit. 




Output 
to 

Preamplifier 



Fig. 2. Input attenuator with ^10 sections and straight- 
through *i path 

tor also contributes series inductance, to the preamplifier 
where it is placed in a low-frequency path. The divide-by-10 
sections require considerable care to ensure that they, be- 
cause of their own parasitic inductance (Fig. 3). also do 
not form resonant circuits. In this attenuator, resistors R3 
and R4 are sized to damp the parasitic inductance associated 
with the capacitors and form a resistive divide-by-10 attenu- 
ator for signals well beyond the amplifier passband. Resis- 
tors Rl and R2 r.arry only relatively low-speed signals and 
are less important to the high-frequency performance of 
the attenuator than capacitors CI and C2. The divide-by-10 
sections have greater bandwidth than the straight-through 
path and must be intentionally slowed down to keep the 
response as seen on screen consistent from range to range. 

Switch Assembly 

The challenges in designing the switch assembly were 
to provide small size, repeatable characteristics, reliability, 
long life, and remote switching. The small size required 
that contact movement be minimal and repeatable. Other- 
wise, contact stress would be excessive and electrical 
characteristics would vary. Minimal contact movement 
also complicated the remote control requirement. The solu- 
tion to these problems was accomplished by the design 
shown in Fig. 4. Contact lift is controlled by the shuttle 
width, contact pin location, and dimensions of the formed 
contact. These parameters are controlled by the tooling 
used in their manufacture and thus are repeatable. Shuttle 
travel is not critical and is easily within the bounds of the 
solenoids and associated switching apparatus. The contact 
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Fig. 3. Resistors R3 and R4 added to circuit ol Fig 1 to damp 
parasitic inductances associated with capacitors C 1 and C2 
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pins are held and positioned by the pin carrier and connect 
directly to the attenuator substrate to keep the electrical 
path .short. The switch contacts and contact pins are made 
of high-gold-content alloys for reliability and to reduce 
vulnerability to plating variations and wear. 

Attenuator Hybrid Circuit 

The attenuator hybrid circuit is shown in Fig. 5. This 
hybrid circuit has a switchable input impedance of one 
megohm or 50 ohms. There are three decades of attenuation 
at one megohm with 1:1, 1:10. and 1:100 divider ratios. 
Switch contacts are made as described in Fig. 4 using small 
pins that extend through the hybrid substrate. The elec- 
tromechanical actuators move a shuttle to open or close 
the appropriate contact. 

The attenuator hybrid circuit's bandwidth exceeds 500 
MHz. The high bandwidth is achieved by implementing 
the circuit elements in thick-film patterns and by 
miniaturizing the contact assembly, both of which mini- 
mize parasitic inductance and capacitance. 

The contact pins are attached to the ceramic substrate 
with conductive epoxy. A print process using a stencil was 
developed to apply epoxy automatically to the fifteen pins 
and into the pins' holes. To keep the epoxy from running 
completely through the holes to the backside of the sub- 
strate, positive air pressure is maintained on the backside. 
The pressure is turned off after the print squeegee passes 
over the holes so that it does not blow the epoxy back out 
of the holes. 

Most of the thick-film resistors are passively trimmed to 
a predetermined resistance value. However, one resistor is 
actively trimmed to set a circuit parameter. The resistor is 
used in the divider network of a circuit that senses danger- 
ous input overvoltages which could harm the 50-ohm. 1% 
input resistor. The divider resistor is actively trimmed by 
applying 10 volts to the input and trimming until 100 mV 
is obtained at the divider's output. Active trimming is typ- 
ically much less expensive than the alternative of manually 



Fig. 4. Attenuator switch assem- 
bly lor HP 541 1 1 D 

adjusting a variable component. 

Preamplifier Hybrid Circuit 

The preamplifier hybrid circuit shown in Fig. 6 is mounted 
in the attenuator/preamp assembly. This amplifier boosts 
the signal from the attenuator to the amplitude required 
by the GaAs track-and-hold stage. The hybrid circuit has 
one-megohm input impedance, a bandwidth of 750 MHz, 
and an overall gain of 36 with a 50-ohm output load. It 
also provides trigger pickoff and conditioning. The 1.5 x 
3.0-inch hybrid circuit dissipates 5 watts. This hybrid con- 
tains a custom input MOSFET, two custom bipolar ICs. 
and two pnp and one npn discrete microwave transistor 
chips. 

The custom MOSFET designed at the Loveland Technol- 
ogy Center includes a limited overload protection circuit. 
The first custom bipolar chip is a preamplifier that has a 




Fig. 5. Top (left) and bottom (right) views ol attenuator hybrid 
circuit 
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diagram 




Fig. 7. Preamplifier hybrid circuit 
(right) and attenuator (left) mounted 
in the preampiattenuator assembly 



programmable 1,2,4 attenuator and a continuous vernier. 
This allows a user to adjust the input signal amplitude to 
full screen for maximum resolution. In addition, this chip 
generates a logic trigger signal used to reduce trigger jitter 
caused by noise contamination of the analog sync used in 
most instruments. Finally, an auxiliary input on the 
preamplifier chip can be switched on to calibrate the entire 
vertical deflection path from the input of the preamplifier 
chip through the analog-to-digital converters (ADCs). thus 
ensuring excellent gain and offset stability. 

The second custom bipolar chip is a driver chip which 
provides the final gain needed to drive the GaAs track-and- 
hold circuits. Included on the chip is a selectable x5 
amplifier. 

As a final part of the signal conditioning the differential 
signal from the ADC driver chip is level-shifted to near 
ground and converted to a single-ended signal. The circuit 
used is a pnp level-shift circuit. The final output voltage 



of the preamplifier hybrid is ±0.64 volt with a 50-ohm 
load. 

The preamplifier hybrid circuit is connected to the at- 
tenuator hybrid circuit through a compressed leaf spring 
that contacts gold pads on each of the hybrid circuits. Fig. 
7 shows the placement of the preamplifier next to the at- 
tenuator in the assembly (the attenuator is to the left of the 
preamplifier). The trigger output from the preamplifier IC 
drives 50-ohm differential transmission lines on the thick- 
film substrate. These lines lead to two SMC connectors 
epoxied to the back of the substrate. The connector pins 
protrude through holes in the substrate so that they can be 
soldered to the transmission lines on top. The linear signal 
output is connected to a third backside SMC connector 
through a 50-ohm microstrip. All of the bias and ground 
connections to the hybrid circuit are made via two dual 
in-line connectors. These are soldered through holes in the 
substrate. 
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Digital Filtering in a High-Speed Digitizing 
Oscilloscope 

by B. Allen Montijo 



IN THE LAST FEW YEARS the digitizing oscilloscope 
has emerged as a useful tool for waveform acquisition 
and characterization of repetitive events. As a real-time 
(single-shot) tool, misconceptions about its abilities and 
limitations are restricting its use. Confusion over "effective 
bits" and "interpolation" causes the user to question the 
validity of data and measurements. To add to the confusion, 
digital signal processing techniques support concepts such 
as trading excess bandwidth for extra resolution. 

Two events occur when a waveform is digitized; the 
signal is sampled and it is converted to a digital format. 
Sampling should be thought of as an analog process. This 
procedure transforms the signal from the s-plane to the 
z-plane and can create aliasing. Once aliasing occurs, no 
operation can recover the original signal without some 
knowledge about the signal. After the signal is sampled, 
an analog-to-digital converter (ADC) generates a digital 
code that represents the analog sample. This process is 
simply a format conversion, similar to converting a real 
number to an integer on a computer. The conversion adds 
quantization noise to the signal which is dependent upon 
the quality of the ADC and the number of bits the ADC 
uses to represent the sample. The digitized signal can be 
thought of as the sampled signal with noise added. This 
article deals with the signal processing aspects of digitizing 
and assumes that aliasing is not a concern. With this as- 
sumption, the digitized signal contains all of the informa- 
tion necessary to reconstruct the original continuous signal, 
with some additional noise. Then the digital samples can 
be interpreted as the original analog signal with quantiza- 
tion noise added. This concept forms the basis for believing 
the information displayed by a digitizing instrument. So 
long as you know what the signal processing is doing to 
your signal and that aliasing is not a problem, you can 
believe what is on your screen. 

Effective Bits and Noise 

Effective fails has become a popular term for describing 
the performance of digitizing instruments. The term com- 
pares the noise characteristics of the instrument with those 
of an ideal ADC. To find an expression for effective bits, 
begin with the equation for the mean squared error of an 
ideal ADC. The error can range from -1/2 to +1/2 of a 
quantization level (OJ with equal probability. This error is 
squared, integrated, and divided by the interval: 

XQ/2 
x 2 dx = CT/12 (1) 
3/2 



If the ADC covers a voltage range of A and has N bits, then 



Q = A/2 N (2) 

so that 

s 2 dl: = A 2 /(12 x 4 N ) (3) 
The equation can be solved for N to give: 

N = log 4 (A 2 /s 2 dc ) - 1.7925 (4) 
For a full-scale sine wave input, the rms voltage is 

B = A/2V2. (5) 

Then 

N = l 0g4 (B 2 /s 2 d , : ) - 0.2925 (fi) 

To find effective bits for a system, the noise term s 2 rfl . is 
replaced with the total system noise s 2 ystem = s 2 nalog + s 2 dc : 

N„ fl = log, (B 2 /s 2 yslBm ) - 0.2925 (7) 

The variable term (B 2 /s 2 y , lD ,„) is simply a signal-to-noise 
ratio (SNR). Effective bits is an expression of SNR; a factor 
of 4 in SNR (a factor of 2 in voltage) is one effective bit. 
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The SNR for an analog system can be plugged into equation 
7 for comparison with a digitizing system. 

For a digitizing system, the SNR is often found using the 
sine-wave curve-fit test. A spectrally pure sine wave is 
digitized, then the best-fit sine wave is calculated for the 
data. The best-fit routine finds the ideal sine wave that 
minimizes the rms error between the data and the ideal 
sine wave. This procedure cancels any systematic errors 
in offset, gain, frequency (sampling and signal), and phase. 
The residual rms error is substituted into the equation as 
the system noise voltage s iyslem to obtain effective bits at 
the signal frequency. The test is repeated at various signal 
frequencies to obtain a curve of effective bits versus fre- 
quency. 

The test is not as straightforward as it sounds. Signal 
amplitude is an important parameter because the slew rate 
of the signal and the amount of distortion depend upon 
the amplitude. High slew rates put stress on the analog 
components and the ADC. and magnify the aperture jitter. 
Distortion is important because the harmonics generated 
are considered noise. For most statistical operations, more 
data is better. In this case, more data places stricter require- 
ments on the phase noise of the sampler and the test signal. 
Since the ideal sine wave is spectrally pure, any relative 
phase noise between the sampling clock and the signal 
decreases the number of effective bits. Finally, the purity 
of the test signal must be better than the unit under test, 
preferably by a factor of ten. This often requires the use of 
filters in the signal path. 

Care must also be taken in gathering data. The signal 
frequency should be selected so that each output code ap- 
pears several times in the data; it is not a good test to input 
exactly the Nyquist frequency since only two output codes 
(ideally) will be produced by the ADC. 

The system noise being measured consists of two compo- 
nents: analog noise and quantization noise. Digital process- 
ing also adds a noise component, but this will be ignored 
since it is insignificant if the processing is done with enough 
bits. Analog component blocks, such as signal conditioning 
and sample-and-hold circuitry, create the analog noise 
while the ADC creates the quantization noise. An often 
neglected source of analog noise is the user's signal. With- 
out special processing the best SNR that can be hoped for 
is the SNR at the instrument input. Fig. 1 demonstrates 
how the number of effective bits is determined by both 
sources of noise. The x axis represents the rms noise at the 
input of the ADC relative to the ADC's voltage range (also 
expressed in dB). Peak-to-peak noise is the rms noise multi- 
plied by five. Noise is shown increasing to the right. The 
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y axis displays the effective bits for ideal ADCs of four to 
eight bits. On the horizontal portions of the curves, system 
performance is limited by the ADC. while on the slanted 
sections, the limitation is analog noise. All the curves over- 
lap when analog noise is dominant. This indicates that 
extra bits of resolution are of no value if they are only 
measuring noise. 

The overall signal-to-noise ratio, rather than the number 
of bits, is the important parameter. The effective resolution 
and measurement accuracy of a system depend on this 
value. More ADC bits may give a prettier display, but may 
be meaningless as well as costly. 

Improving SNR by Filtering 

The seven and eight-bit filters in the HP 54111D Oscillo- 
scope are low-pass filters that allow the user to trade off 
excess bandwidth for improved noise performance. Al- 
though this is often done in analog systems, digitization 
adds complications. 

How do spectrum analyzers achieve a high signal-to- 
noise ratio? By limiting the bandwidth during a measure- 
ment, most of the noise is filtered out. allowing the detec- 
tion of small signals. For a wideband system, such as an 
oscilloscope, the major analog noise contributor can be 
assumed to be white in nature, that is. every frequency 
band of a given width contains the same noise power. This 
is reflected in the typical specification for noise, which is 
expressed as volts per square-root hertz, or in terms of 
power as watts per hertz. The noise is present at all frequen- 
cies within the bandwidth of the system, whether or not a 
signal actually exists. For a real-time sampled system, the 
maximum bandwidth is the Nyquist frequency f„. If the 
full system bandwidth is not needed for a signal, then the 
noise can be reduced by limiting the bandwidth, thereby 
increasing the SNR. If the noise is white, there is a direct 
correlation between bandwidth reduction and noise reduc- 
tion. SNR improvement by a factor of 4 gives one effective 
bit improvement; that is, filtering out 75 percent of the 
noise improves the performance by one effective bit. 

The linear model for the system is shown in Fig. 2. Analog 
noise is added to the signal by the preamplifier, then quan- 
tization noise is added by the ADC. The quantization noise 
may be a function of the input, but the model is still linear. 
Superposition can be used to create the model shown in 
Fig. 3. Each component of the digitized signal can be 
analyzed easily. Assume that the filter is a perfect low-pass 
filter at one fourth of the Nyquist frequency. If the input 
is within the bandwidth of the filter, it passes through the 
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ot noise. 



Fig. 3. Superposition applied to the serial model ol Fig 2 
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Dithering in the HP 5411 ID 



In general, dithering is the addition of a small signal, possibly 
noise, to a system to improve measurement accuracy. For exam- 
ple, many DVMs average a number ol samples to give a more 
accurate measurement Random noise helps the averaging pro- 
cess work, but a controlled signal provides better accuracy with 
fewer samples. 

Most ADCs have reference inputs that allow the user to deter- 
mine the voltage range for conversions. All of the output codes 
are referenced to this voltage range. If the references are raised 
by one volt, then the meanings of all of the output codes are 
also raised by one volt. That is, it takes one more volt on the 
input to get a particular code. Alternatively, one volt can be 
subtracted from the signal before the ADC to achieve the same 
effect To get a particular code, the input signal must be one 
volt higher. The ADC acts as a buffer with differential inputs. The 
input minus the reference determines the output code Adding 
a signal to the user's signal can be accomplished by either 
summing the two signals before the ADC or by subtracting the 
signal from the ADC references Therefore, the dither signal can 
be miected by subtracting it from the ADC references. 

The dither signal desired has increments of LSB/4 between 
samples to achieve an equivalent resolution of eight bits With 
LSB/4 steps, the period of the dither signal should be four sam- 
ples. The HP 541 1 1 D uses four ADCs sampling 90 degrees out 
of phase to achieve a 4 x improvement in sample rate Therefore, 
if the dither works correctly, each ADC will "see" the same point 
of the dither signal each time it samples, Since the period of the 
dither signal is 4/f s , all frequencies within the signal are at multi- 
ples of f 5 /4. Each ADC is sampling at f s /4, so the dither signal 
spectrum appears at multiples of the individual ADC sampling 
frequency. All of these frequencies will alias to dc. To each of 
the ADCs. the dither signal appears to be a dc signal Then the 
desired result can be obtained by adding the appropriate dc 
signal to Ihe references of each ADC. This gives a stable, accu- 
rate dither signal for optimum performance. 

To prevent increasing the rms quantization noise, the dither 
signal must be subtracted from the ADC output code The six-bit 
ADC code is augmented with two bits so that the four dither 



values can be represented. With the final eight-bit code, the 
mean quantization and rms errors remain unchanged 

The order of the dither values is almost as importani as the 
step size The pattern should give consistent results whether an 
edge is rising or falling The frequency content should also be 
spread out for the greatest attenuation by Ihe filters (only two 
frequencies, f„ and f,/2 are available with the given period) The 
dither pattern at the ADC reference that best fits this requirement 
is 0, !6, Va, V*. This gives dither bits of 00, 10, 01, 11 Note that 
the larger part of the signal, the MSB, is changing at f„ while the 
LSB is changing at f n /2 

Dithering improves all modes of operation m the HP 541 11D 
The seven-bit and eight-bit modes are described In the accom- 
panying article. In the six-bit (real-time) mode, a nonlinear smooth- 
ing filter improves the accuracy of low-slew-rate signals with only 
a minimal effect on fast edges. The filter averages four samples 
(two before and one after the current sample) If the average is 
within LSB/2 (six-bit) of the current sample, then the current output 
is the average Otherwise the current output is the current sample 
changed toward the average by LSB/2. Although the smoothing 
filter would work with random noise, the quality of Ihe dither signal 
improves the performance 

In repetitive mode, dithering improves averaging by providing 
a resolution of LSB/4 with repeated acquisitions Persistence is 
the mode with the least improvement Dithering allows all levels 
of the screen to be filled in and increases the accuracy of some 
measurements made by the user For example, an eye pattern 
is a multivalued function, so averaging mode does not work 
correctly. When the user measures a portion of the waveform, 
the dithering provides LSB/4 resolution 

Dithering could be carried further, providing LSB/8 or LSB.' 16 
resolution with a longer signal period. However, since real ADCs 
are not perfect, a practical limit exists for any ADC. The ADCs 
m the HP 5411 ID are tested to LSB/4 accuracy, or one LSB at 
eight bits. With this specification, dithering to LSB/4 resolution 
is an improvement over LSB/2. but LSB/8 would not provide a 
significant advantage (particularly since the HP 54111D's four- 
ADC scheme accomplishes LSB/4 dithering so easily) 



filter untouched. The analog noise is white, so it is reduced 
by a factor of 4. 

For many signals, quantization noise will appear to be 
white noise, uncorrelated with the input. For the moment, 
assume that this is the case. Then the filtering process will 
reduce the quantization noise by a factor of 4. Since both 
noise components have been reduced by 4, the SNR has 
been improved by a factor of 4, or one effective bit. Simi- 
larly, the eight-bit filter has a bandwidth of one sixteenth 
of the Nyquist frequency and improves the SNR by a factor 
of 16, or two effective bits. 

Statistically, this can be explained as follows. The filter 
is an FIR (finite impulse response) filter. The filter performs 
a weighted moving average on the input to find the out- 
put. Let the filter have N coefficients, where N is odd, and 
let M = (N - l)/2. The filter coefficients are h,, where 
i = -M M. To achieve unity gain at dc: 

M 

, 2 M h, = 1 (8) 



The filtering equation for a symmetrical filter is: 

v k = S h iXk<1 (9) 

1 = — M 

A simple example of this would be the filter defined by 
the coefficients 0.25, 0.50, 0.25. With input x and output 
y, the value for y k is 

y k = (l/4)x k _, + (l/2)x k + (l/4)x k _, (10) 

If the filter input x is the noise, the noise at the output is: 

s* = var(y k ) = var J§^ itPk+i J 

= | h?s? + 22 h i h j [cov(s,.s,)] (U) 

i=-M j = i+l 
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where var is variance and cov is covariance. The noise 
characteristic for all samples is identical, so s, = s ln . If 
the noise is white, all noise samples are independent, so 
cov(Sj, S,) = 0. The equation reduces to: 

s 2 = i hfs? n = sf„ g h? (12) 

i=-M I--M 

The output noise power is the input noise power times 
the sum of the filter coefficients squared. To gain an effec- 
tive bit, this sum must be less than 0.25. This is not difficult 
to achieve since all of the coefficients have a magnitude 
less than one and most are positive. The filter given above 
reduces the noise to (1'4) 2 + (1/2) 2 + (1/4) 2 = 37.5 percent 
of its original value. However, the filter has poor passband 
characteristics so that the signal is also reduced at all fre- 
quencies but dc; the SNR is improved by 8/3 only at dc. 

When is the overall system noise white? This occurs 
when either white analog noise or white quantization noise 
dominates. With either of these cases, the improvement in 
SNR is straightforward. Quantization noise is generally 
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Fig. 4. Interpolation by a factor of 4 (a) Example signal spec- 
trum before sampling, (b) Spectrum after sampling at fre- 
quency I,,, (c) Sampled data points, (d) Adding three zeros 
between points shown in (c). (e) New spectrum (t) Desired 
low-pass filter (g) Final result. 



considered to be white whenever the signal changes by at 
least a few quantization levels between samples. In this 
case, the error from one sample to the next will take on a 
random pattern and the error signal will appear to be un- 
correlated. In general, the more bits in the ADC. the higher 
the signal frequency, and the larger the signal amplitude, 
the more random the quantization noise. 

Subharmonics can cause problems creating white noise, 
depending upon the order of the subharmonic and the filter 
length, For example, an input at one eighth the sampling 
frequency f s will give at most eight different codes (in the 
absence of analog noise). The quantization error for every 
eighth sample is correlated. If the filter has a length longer 
than eight, then the covariance term in the noise equation 
is nonzero. The worst-case signal for generating white 
quantization noise is dc with no analog noise. For this 
signal, the ADC outputs the same code for each sample 
and the quantization error is exactly the same (this is the 
degenerative case of a sampling frequency subharmonic). 
The quantization noise signal is a dc signal — certainly a 
poor approximation of white noise. For a dc signal, the 
covariance term in equation 11 exactly cancels the increase 
in the SNR. 

In the HP 5411 ID, this problem is solved by adding a 
dithering signal to the analog signal before the ADC (see 
the box on page 72), The signal is precisely designed to be 
three fourths of an LSB peak to peak with its entire fre- 
quency content in ihe stop band of the low-pass filters. 
Sampling takes place so that four successive conversions 
sample the dither signal at LSB/4 intervals. The six-bit code 
from the ADC is augmented with two bits to cancel the 
error from the dither signal. For a dc signal, four successive 
samples give a resolution of LSB/4 for eight bits of resolu- 
tion. For example, assume that an ADC has a Q level of 
one volt and that the code is expressed in volts such that 
code N represents N volts and will be present whenever 
the input voltage is s*(N-0.5) volts and <(N + 0.5) volts. 
If the input voltage is 1.5V, then the ADC outputs code 2 
for an error of 0.5V. If dithering is added, then analog 
voltages of 0, -0.5. -0.25, and -0.75 volt are added to 
successive samples. The voltages lhat the ADC sees are 1.5. 
1. 1.25, and 0.75 volts so that it outputs codes 2, 1, 1, 1. 
Note that the average value of the samples is 1.25V. The 
average value of the dither signal is - 0.375V. so the average 
value of the signal is 0.375 volts higher than indicated. 
That is. the measured voltage is 1.625 volts for an error of 
0.125V. 

This example gives the maximum error for both cases. 
This is a factor of 4 in voltage, 16 in power, or two effective 
bits! If we augment the output codes with two bits to cancel 
the dither, then the outputs are 2, 1.5, 1.25, and 1.75 for 
an average value of 1.625 volts. When the outputs are av- 
eraged (filtered), the extra two bits neither help nor hurt 
the resolution; the extra resolution comes from providing 
enough extra code thresholds over four samples to match 
the number of thresholds for an eight-bit ADC. 

In the frequency domain, the dithering signal pushes 
most of the noise at dc (15/16 of it) to the dithering frequen- 
cies. Since these frequencies are in the stop band of the 
filters, the noise is eliminated from the output. Examination 
of the dithering characteristics in equation 11 shows that 
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the covariance term is negative so that the overall improve- 
ment is better than with white noise. 

Under conditions where white noise dominates the 
input, the dithering signal has no effect: it simply appears 
as a high-frequency signal that is filtered out by the low- 
pass filter. Hence, the dithering has taken the worst-case 
signal and turned it into the best-case signal without hurt- 
ing the original best case. A signal at any frequency is 
improved in relation to the quantity of noise that is corre- 
lated. 

The filters used in the HP 5411 ID do not come close to 
ideal low-pass filters (see the box on page 75). The goal 
was to optimize SNR improvement while maintaining a 
good step response. The desired SNR improvement deter- 
mines the minimum stop-band width: 3f„/4 for the seven- 
bit filter and 15f,,/16 for the eight-bit filter. The stop-band 
attenuation is not critical: above 20 dB it is easier to improve 
the SNR by widening the stop band than by increasing 
the attenuation. (Infinite attenuation is not much improve- 
ment when the major source of noise is in the passband.) 
The maximum amount of overshoot and ringing on the 
step response is known. These factors determine the width 
of the transition band and, in turn, the passband. To provide 
the required SNR improvement over as wide a frequency 
band as possible, the passband is as flat as possible. The 
bandwidths of the seven-bit and eight-bit filters are 100 
MHz and 25 MHz at full sample rate and they provide one 
and two extra effective bits to frequencies of 83 MHz and 
18 MHz. respectively. 

The seven-bit and eight-bit filters in the HP 54111D, 
along with dithering, provide increases of one and two 
effective bits at the expense of bandwidth. In Fig. 1, the 
filters move the operating point at least one or two ADC 
lines up as a result of reducing quantization noise and one 
or two tics (6 or 12 dB) to the left as a result of reducing 
analog noise. Note that any noise in the user's signal is 
included whenever measurements are made and that the 
filters reduce this noise too. The HP 5411 ID has better 
effective-bits performance in the seven-bit filter mode than 
it would have with an eight-bit ADC without filters, al- 
though a penalty is exacted in lower bandwidth. 

Interpolation Process 

One of the most questioned functions of digitizing oscil- 
loscopes is interpolation. Users feel uncomfortable with 
the "fictitious" data that is placed between samples to fill 
in the waveform. Most interpolators are linear functions 
that can be described in the same sequence of steps. This 
includes common filters such as sin(irx)/trx (i.e.. sinc(x)), 
Gaussian, and Nth-order polynomials that fit the sample 
points exactly (LaGrange interpolation). A specific example 
of interpolation by a factor of four will demonstrate the 
steps in the frequency domain. 

For simplicity, assume that the analog signal has a trian- 
gular spectrum as shown in Fig. 4a. After sampling the 
signal, the spectrum is repeated at intervals of f s . the sam- 
pling frequency, as in Fig. 4b. Mathematically, any combi- 
nation of the spectrums could account for the sampled 
waveform. The purpose of interpolation is to increase the 
sample density as though the signal were sampled at a 
higher rate. The first step in interpolation is to increase 



the sample density. Zero-valued data points are inserted 
between the actual data points (Fig. 4c) to get the desired 
density. For the example, three zeros are added to increase 
the density by a factor of four (Fig. 4d). In the frequency 
domain, this simply places the new sample rate, f^, at 4f s 
and the new Nyquist frequency f,', at 2f s (Fig. 4e). There 
are now four images of the original spectrum below the 
Nyquist frequency. This means that a combination of spec- 
trums is required to produce the zero-filled signal. The 
SNR is 1/3, since there are three undesired spectrums to 
the one desired spectrum. The final step is to filter the 
result to leave the original spectrum at a higher effective 
sample rate (Figs. 4f and 4g). This low-pass filter is often 
a sine function or a Gaussian function. 

The most difficult concept of the process to understand 
is the spectrum of the zero-filled signal. A simple example 
may clear up the confusion. The signal 

'A + cos(2irf s t)/2 + cos(4irf s t)/4 

can be used to modulate the original signal before it is 
sampled (Fig. 5a). If the signal is sampled at frequency f^ 
in phase with the modulation signal, the result is identical 
to Fig. 4d. The spectrum of the analog signal before the 
digitizer is the convolution of the original signal spectrum 
(Fig. 5b) with the spectrum of the modulation signal (Fig. 
5c), giving the "zero-filled" spectrum (Fig. 5d). After sam- 
pling, the energy from 2f 5 to 2.5f 5 aliases back to the range 
2f s to 1.5f s . doubling the height of the short triangle. Sam- 
pling also causes this new spectrum below 2f s to be copied 
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Fig. 5. (a) Multiplying input signal by modulation signal Orig- 
inal spectrum (b) convolved with modulation signal spectrum 
lc) gives a "zero-lilled" spectrum (d) The result alter sampling 
at ljj fs identical to Fig. 4e. 
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Digital Filters 



The digital fillers m the HP 541 11D are FIR (finite impulse 
response) fillers The FIR tiller has no unsampled analog counter- 
part since il has no poles — only zeros One of the advantages 
of FIR filters is thai they can easily Be designed with perfectly 
linear phase, providing a constant delay through the filler regard- 
less of frequency. Constant delay is important for making accu- 
rate measurements Without constant delay, the apparent time 
between two waveform edges could be a function of the differ- 
ence m edge speed For example m a digital circuit this might 
be a function of logic family or signal loading. 

One characteristic of linear phase is that the step response is 
antisymmetrical, the preshoot is a mirror image of Ihe overshoot 
In an oscilloscope, this appears as noncausal behavior since 
the filtering routine takes Ihe constant delay into account when 
placing the filtered points on screen Although this is unsettling 
when first encountered, it is an advantage over an analog filter 
since the user can often separate the matching overshoot/pre- 
shoot response from the actual signal overshoot 

Another characteristic of the digital filters is that the bandwidth 
is related to the sample rate When the sample rate changes, 
the bandwidth changes Some users consider this an advantage 
and some a disadvantage. Generally speaking, the sample rate 
is not lowered unlil the screen shows only one or two periods of 
a waveform. In this situation, the lower bandwidth does not ap- 
preciably alter the signal, but still provides the increase in effec- 
tive bits For example, at 100 ns'div. the sample rate drops to 
500 megasamples per second. Now the seven-bit filler has a 
bandwidth of 50 MHz and a rise time of approximately 7 ns. The 
largest change in response occurs for a perfecl step, where one 
sample is on ihe boltom of Ihe step and the next is on Ihe top 
In this case, the filtered rise time will be 7 ns, or less than a tenth 
of a division The rise lime for the eight-bit filter will be about 30 
ns, or one third of a division 

The FIR fillers achieve high throughput by using lookup tables 
A table exists for each filler with each coefficient value multiplied 
by all possible data values At run time, Ihe microprocessor need 
only perform lookup and add operations. The sizes of the tables 
were minimized by taking into account properties of Ihe filters 
and data The linear phase characteristics are achieved by mak- 
ing Ihe filler symmetrical about Ihe center coefficient This allows 
a reduction in size by a factor ot two. The eighl-bil data values 
are shifted lo a Iwo's complement formal to give 128 nonzero 
magnitudes instead of 255 Altogether a data reduction of nearly 
four to one was accomplished Further reduction is possible, but 
would lower Ihe throughput of the inslrument Throughput could 
have been increased by using a math coprocessor or a digital 
signal processing chip, but this was not done because of cost 
and space considerations. 

Many of the limitations lo digital filters parallel limitations to 
analog filters. The characteristic of most concern is that the steep- 
ness of Ihe transition region between the passband edge and 
Ihe stop-band edge determines the amount of overshoot and 
ringing in the step response. The filters in the HP 541 1 1D were 
designed with this in mind, trading off bandwidth for pulse fidelity 



at intervals of 4f s (f j). giving a sampled spectrum identical 
to Fig. 4e. 

The requirements for the HP 54 1 1 ID s interpolation filter 
are more stringent than for the seven-bit and eight-bit fil- 
ters. The filter can interpolate up lo a factor of 100. For 
this case, the SNR before low-pass filtering is 1/99. or -20 
dB. The filter stopband must attenuate the 99 spectral rep- 
licas to achieve an SNR of 42 dB. This minimizes the reduc- 
tion in effective bits and the measurement uncertainty in 
both time and voltage. Since there is no anti-aliasing filter 
in the instrument (to allow a 500-MHz repetitive band- 
width), the stopband must begin near the Nyquist fre- 
quency of 500 MHz. To maintain good pulse fidelity, the 
maximum amount of overshoot and ringing is determined. 
As with the other filters, these quantities determine the 
width of the transition band and the passband. The filter 
is down 2 dB at 250 MHz and 48 dB at 515 MHz with only 
6% overshoot. 

The sinc(x) and Gaussian filters often used are extreme 
types of filters. The sinc(x) filter gives the maximum 
bandwidth (f„). but has a poor step response. The Gaussian 
filter adds no overshoot to the signal, but requires a very 
wide transition band, giving an unnecessarily narrow pass- 
band. The HP 5411lD's interpolation filter is somewhere 
between these two extremes. One advantage to having a 
bandwidth of f s /4 rather than f s /2 is that the transition region 
provides some anti-aliasing protection. Any signals in the 
500-to-750-MHz range will alias to the 250-to-500-MHz re- 
gion, The attenuation of the transition band will perform 
an anti-aliasing function for this frequency range. Beyond 
750 MHz the attenuation of the analog signal path provides 
aliasing protection. 

A side effect of having a bandwidth lower than Ihe 
Nyquist frequency is that filtered points do not always 
overlay the actual sampled points. Many users believe thai 
this is a shortcoming of the interpolation, although it is a 
necessary phenomenon. Any movement in samples results 
from reduced bandwidth after Interpolation; no one would 
expect Ihe bandwidth-limited trace on an analog oscillo- 
scope to look like the full-bandwidth trace. To leave the 
actual samples in place, the filter's impulse response must 
have the exact zero crossings of the sinc(x) filter and its 
characteristics will approach those of the sinc(x) filter. II 
is very limiting on the filter design and fidelity to enforce 
such a requirement. 

Interpolation is not a difficult process, but must be done 
with care. The result is consistent with the response of the 
interpolation filter so long as aliasing is not a concern. 

Summary 

Digitizing a waveform includes two separate effects: sam- 
pling and format conversion. Sampling is an analog process 
that transports the signal from the s-plane to Ihe /.-plane, 
resulting in the possibility of aliasing. The format conver- 
sion simply changes Ihe signal loa form that can be handled 
by a computer, adding noise in the process. If aliasing can 
be ignored, Ihe digitized data contains all of the information 
necessary lo reconstruct the signal, with a little added 
noise. For any measurement system, the signal-to-noise 
ratio is an important parameter. With digitizing instru- 
ments, this is more important than Ihe actual number of 
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bits or the resolution of the display. Since digitized data 
is merely a different representation of a waveform, the same 
techniques that have been used to improve the SNR in 
analog systems can be used with digital systems. The seven- 
bit and eight-bit filters are digital equivalents of the 
bandwidth-limit button on traditional analog oscillo- 
scopes. 

A function that has often been questioned by digitizing 
oscilloscope users is interpolation. Waveform fidelity is 
highly dependent upon the quality of the interpolation 
filter. There is a direct trade-off between the bandwidth-to- 
sampling-rate ratio and the quality of interpolation. The 
HP 5411 ID does not present the best filter for all types of 
signals, but it does provide a high-bandwidth, good-quality 
filter that works very well for the types of signals viewed 
most often on oscilloscopes — sine waves, pulse trains, and 
logic signals. 

The digital filters in the HP 541 llD provide benefits that 
are not easily available in analog counterparts. First, the 
filters have no delay at any frequency and timing measure- 
ments are not altered because of nonlinear phase charac- 
teristics. Second, a stable system response is achieved, not 
only from unit to unit, but with time and temperature. 
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